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CHONDROCYTE PROTEINS 

5 

This application claims the benefit of U.S. 
Provisional Application Serial No. 60/021,672, filed 
July 5, 1996. 

This work was supported by the National Institutes 
10 of Health Grant No. AR38945. The Federal Government may 
have certain rights in the invention. 

FIELD OF THE INVENTION 

15 The present invention relates to proteins 

expressed in chondrocytes, DNA molecules encoding these 
proteins, and their uses. 



20 



BACKGROUND OF THE INVENTION 



Endochondral ossification is remarkably similar in 
diverse biological settings. The remodeling of calcified 
cartilage into bone can be found in embryonic sterna, 
; i. vertebrae, and limbs, juvenile long bone development, 

25 fracture healing by callus formation, and ectopic bone 

formation induced by bone morphogenetic proteins. The same 
process can also be found in pathologic conditions, such as 
cartilaginous neoplasms, heterotopic ossification, and 
degenerating articular cartilage. This commonality suggests 

30 that mineralizing chondrocytes are committed to the same 
innate developmental pathway. 

During the process of endochondral ossification, 
chondrocytes undergo a progression of maturational changes, 
with marked biochemical and physical changes in both the 

3 5 cells and surrounding matrix. These changes are most 

evident in the growth plate where they are spatially and 
temporally ordered (Buckwalter et al . , J. Bone and Joint 
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Surg. , 68A:243-255 (1986); Gibson et al . , Cell Biol . . 
101:277-284 {1985); and Poole, "Cartilage in Health and 
Disease", Arthrit is and Allied Conditions: A Textbook of 
Rheumatoloq-y, 279-333, (1993)). Resting chondrocytes are 
flat, irregularly- shaped nondividing cells. As these cells 
enter the cell cycle, they become arranged in columns and 
undergo the rapid proliferation necessary for long bone 
growth. Collagen fibrils in the resting and proliferating 
region of the growth plate are predominantly type II 
collagen with associated minor collagens type IX and type XI 
(Buckwalter Clin. Orthop. . 172:207-231 (1983) 
( "Buckwalter" ) ; Oshima et al . , Calcif . Tiss . Int . . 45:182- 
192 (1989) ("Oshima"); Castagnola et al . , J. Cell Biol . . 
102:2310-2317 (1986); Liu et al . , Dev. Dynamics . 198:150-157 
(1993); and Linsenmyer et al . , Development . Ill, 191-196 
(19-91)) . The matrix is characterized by an abundance of 
high molecular weight proteoglycans, which have a structural 
role in addition to preventing calcification (Buckwalter; 
Dziewiatkowski et al., Calcif . Tiss . int. . . 37:560-567 
(1985); Kosher et al . , Dev. Biol . . 118:112-117 (1986); and 
Chen et al . , Calcif. Tissue Int. . 37:395-400 (1985)). In 
the hypertrophic region of the growth plate, proliferation 
ceases and a significant increase in cell volume, up to 8- 
fold, occurs. Hypertrophic chondrocytes form arcades and 
initiate the synthesis of type X collagen, while collagen 
types II and IX and proteoglycan content decrease. In the 
most inferior part of the growth plate, adjacent to the 
metaphysis, the cartilage mineralizes. Hypertrophic 
chondrocytes in the calcified tissue may undergo apoptosis 
(Shapiro et al . , J. Bone Min. Res . . 10 (SI) :S238 (1995); 
Fujita et al . , Trans. Ann. Mta . Othop . Res. Soc . . 20:470 
(1995); and Farnum et al . , Trans. Ann. Mtcr. Othop . Res. 
Soc • • 2Q-.11 (1995)), partially convert to an osteoblastic 
phenotype (Cancedda et al . , J. Cell Biol . . 117:427-435 
(1992) ) , or remain quiescent until resorption by the 
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invading blood vessels. The signals necessary for 
calcification are poorly understood, but calcification 
appears to be effected through the production of matrix 
vesicles, which contain alkaline phosphatase, phospholipase 
A 2 , NTP-pyrophosphohydrolase, calcium, phosphate, and matrix 
metalloproteases (Dean ec al . , Calcif . Tissue Int . , 50:342- 
349 (1992); Lewinson et al . , J. Histochem. and Cytochem. , 
30:261-26 (1982); Wuthier et al., Cal . Tissue Int . , 24:163- 
171 (1977); and Watkins et al . , Biochem. Biophvs. Acta . 
631:289-304 (1980)). The calcified cartilage serves as a 
scaffold for vascular invasion and deposition of the primary 
spongiosa . 

A variety of cell culture models have been 
utilized to study the developmental changes associated with 
endochondral ossification. Embryonic chondrocytes from 
sterna (Leboy et al . , J. Biol. Chem . , 264:17281-17286 (1989) 
("Leboy") ; Sullivan et al., J. Biol . Chem. . 2 6 9:22500-22506 
(1994) ("Sullivan") ; and Bohme et al . , Exp . Ce ll Res . , 
216:191-198 (1995) ("Bohme")), and vertebra (Lian et al . , 
J. Cellular Biochem. . 52:206-219 (1993) ("Lian")), limb bud 
mesenchymal cells in micromass cultures (Roark et al . , 
Develop. Dvnam . , 200:103-116 (1994) ("Roark") and Downie et 
al . , Dev. Biol . , 162:195 (1994) ("Downie")), growth pirate 
chondrocytes in monolayer (Rosselot et al . , J . Bone Miner . 
Res . , 9:431-439 (1994) ("Rosselot"); Gelb et al . , 
Endocrinology , 127:1941-1947 (1990) ("Gelb"); andCrabbet 
al. J. Bone Mineral Res . . 5:1105-1112 (1990) ("Crab")), or 

pellet cultures (Kato et al . , Proc . Na t. Acad. Sci . , 

85:9552-9556 (1988) ("Kato")) have been used to characterize 
chondrocyte responses to exogenous factors, many of which 
function in an autocrine manner. From these studies has 
emerged a critical role for a number of growth factors, 
including bFGF, TGF/3 , IGF- 1, and PTHrP, which are present in 
the growth plate and regulate chondrocyte proliferation and 
differentiation. The expression of these factors and their 
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associated receptors are maturation dependent and 
exquisitely regulated in the growth plate (Bohme, Roark, 
Rosselot, Gelb, Crabb, and Hill et al . , Prog.' Growth Factor 
Res . . 4:45-68 (1992)). Other studies have shown that 
5 vitamins A, C, and D are also required for chondrocyte 

maturation (Leboy; Sullivan; Iwamoto et al . , Microscopy Res, 
and Technique , 28:483-491 (1994); Iwamoto et al . , Exp. Cell 
Res . . 207:413-420 (1993); Iwamoto et al . , Exp . Cell Res . . 
205:213-224 (1993); Pacifici et al . , Exp . Cell Res . . 195:38- 

10 46 (1991); Shapiro et al . , J. Bone Min. Res . . 9:1229-1237 
(1994); Corvol et al . , FEBS Lett . . 116:273-276 (1980); 
Gerstenfeld et al . , Conn . Tiss . Res . . 24:29-39 (1990); 
Schwartz et al . , J. Bone Miner. Res. . 4:199-207 (1989); and 
Suda, Calcif Tissue Int. . 37:82-90 (1985)). 

15"" Transgenic mice and human cartilage defects have 

also provided information about endochondral ossification. 
Transgenic mice with deletions of the PthrP gene show 
premature hypertrophy of growth plate chondrocytes, 
demonstrating a role for PTHrP in cell proliferation and 

20 suppression of hypertrophy (Karaplis et al . , Genes and 
Develop . . 8:227-289 (1994)). Human mutations in the 
collagens II, IX, X, and XI are the genetic bases for mild 
to severe (lethal) .cartilage dysplasias (Kivirikko et al . , 
Ann. Rev. Biochem. , 64:403-434 (1995)). Roles for sulfate 

25 transport (Hastabacka et al . , Cell , 78:1074-1087 (1994)), 
sulfate metabolism (Franco et al . , Cell , 81:15-25 (1995)), 
FGF receptor 3 (Shiang R. et al . , Cell, 78:335-42 (1994)), 
and the transcription factor SOX9 (Wagner et al . , Cell . 
79:1111-1120 (1994)) in normal cartilage development have 

30 all been demonstrated by identification of genetic defects 
in human families. 

The FGF receptor, sulfate transporters, and SOX9 
are among the few examples of cellular proteins that have 
demonstrated roles in cartilage development . As outlined 

35 above, many of the proteins with critical roles in cartilage 
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biology are either extracellular matrix proteins or 
signalling molecules. Thus, the genes and gene products 
instrumental to regulating the transition of chondrocytes 
from one stage to the next have yet to be fully 
5 characterized. Biochemical techniques used to identify 
matrix or intracellular components may not be sensitive 
enough to detect weakly or transiently expressed proteins. 
Furthermore, identification of cartilage defects in human or 
mouse mutants as a method to identify important cartilage or 
10 chondrocyte-specif ic proteins is limited by the number of 

mutants available and the labor involved in combined genetic 
and molecular approaches. 

The present invention is directed to overcoming 
these and other deficiencies in the art. 

15 

SUMMARY OF THE INVENTION 

The present invention relates to an isolated 
protein or polypeptide selectively expressed in chondrocytes 
20 in lower proliferative or upper hypertrophic zones of long 
bones and embryonic vertebrae growth plates. The encoding 
, DNA molecule, in either isolated form or incorporated in a 
heterologous (i.e. not normally containing the DNA molecule 
of the present invention) expression system or a host cell, 
25 '. is also disclosed. 

The present invention also relates to an. antibody 
or binding portion thereof or probe with recognizes the 
protein or polypeptide. 

Another aspect of the present invention relates to 
30 a method of identifying the occurrence of proliferation or 
hypertrophy of chondrocytes in a tissue sample. The sample 
is contacted with either the subject antibody, binding 
portion thereof, or probe; a nucleotide sequence of the DNA 
molecule encoding the subject protein or polypeptide as a 
35 probe in a nucleic acid hybridization assay; or a nucleotide 
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sequence of the DNA molecule encoding the subject protein or 
polypeptide as a probe in a gene amplification detection 
procedure. An assay system is used to detect any reaction 
which indicates that an isolated protein or polypeptide 
selectively expressed in chondrocytes in lower proliferative 
or upper hypertrophic zones of long bones and embryonic 
vertebrae growth plates is present in the sample. 

The present invention also relates to a method for 
preventing chondrocytes from transitioning from 
proliferation to hypertrophy and to a method for inhibiting 
arthritic progression of articular chondrocytes in a 
patient. These methods include reducing expression in the 
chondrocytes of a protein or polypeptide that is selectively 
expressed in chondrocytes in lower proliferative or upper 
hypertrophic zones of long bone and embryonic vertebrae 
growth plates. The present invention also relates to a 
method for inducing chondrocytes to transition from 
proliferation to hypertrophy and a method for treating non- 
union bone defects. These methods include increasing 
expression in the chondrocytes of a protein or a polypeptide 
selectively expressed in chondrocytes in lower proliferative 
or upper hypertrophic zones of long bone and embryonic 
vertebrae growth plates-. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A is a Northern Blot hybridization. 
Figures IB and 1C are RNAase protection analyses. In 
Figure 1A, five micrograms of total RNA from growth plate 
and articular chondrocytes were loaded onto multiple pairs 
of lanes of a formaldehyde gel, electrophoresed , then 
transferred to GeneScreen Plus, Adjacent pairs were then' 
hybridized with three different Band 17 cDNA fragments 
labeled with 32 P. Location of probes I, II, and IV within 
Band 17 cDNAs is given in the legend for Figure 5 . 
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Figure IB shows the results of an RNAase protection analysis 
of Band 17 expression of the 2.2 and 5.0 kb transcripts in 
chicken tissue. Riboprobes from the 260 bp cDNA template 
(probe II) were hybridized to 10 total RNA prepared from 
a variety of tissues from juvenile chick. Protected RNA 
fragments were separated on denaturing acrylamide gel and 
analyzed by autoradiography. Lanes contain RNA from brain 
(B) ; articular chondrocytes (A) ; growth plate chondrocytes 
(G) , heart (H) , Kidney (K) , liver (L) , lung (N) , skeletal 
muscle (M) , skin (S) , and spleen (P) . Glyceraldehyde -3 - 
phosphate dehydrogenase ("GAPDH") is used as a control and 
is pictured under the Band 17 samples. Yeast tRNA did not 
give a protected fragment. UP designates the position of 
the undigested (full length) probe RNA (lane not shown), and 
PP designates the position of the protected band. Figure 1C 
depicts the results of a RNAase .protect ion analysis of the 
5.0 and 6.2 kb transcripts. The same samples were used as 
described with regard to Figure IB. Separate tissue RNA 
samples were hybridized to either a 5.0 kb specific cRNA 
(probe III, Figure 5), a 6.2 kb-specific cRNA probe (probe 
IV), or a GAPDH probe. Note that the GAPDH control 

:,. indicates that the liver and muscle RNAs were in significant 
excess compared to the growth plate chondrocyte sample . 

Figure 2 depicts an in situ hybridization used to 

'examine Band 17 expression in the long bone growth plates of 
6-8 week chicks and the developing bones of 18 day chick 
embryos. The sections were hybridized with a 33 P-labeled 
riboprobe that hydridizes to all Band 17 transcripts 
(Probe I in Figure 5) . Hybridization conditions were 50% 
formamide, 2XSSC at 56°C. Wash conditions were 68°C in 
0.1XSSC. Light field and dark field photomicrographs were 
taken of identical sections. R, P, and H in the light field 
photomicrographs designate the resting, proliferating, and 
hypertrophic zones of the growth plates. 
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Figure 3 is an RNAase protection analysis of Band 
17 expression performed in cultured sternal chondrocytes. 
Additions to the media were either NuSerum ("NSM") and/or 
ascorbate {"ASC") . The template for the RNA probe 
corresponds to probe I in Figure 5, and hybridizes to all 
Band 17 transcripts. Y designates the lane containing probe 
hybridized to yeast tRNA. UP and PP designate the position 
of full length probe and protected fragment. 

Figures 4A-4C show the time course of Band 17 
expression in juvenile chicken growth plate chondrocytes in 
culture. Figure 4A is an RNAase protection analysis of Band 
17 expression in growth plate ("GP") cells. Samples were 
either five /xg RNA from freshly isolated juvenile growth 
plate tissue (lane F) , five /xg RNA from enzymat ically 
released chondrocytes (lane U) , or yeast tRNA (lane Y) . The 
template for the RNA probe corresponds to probe II in 
Figure 5 and recognizes the 2.2 and 5.0 kb transcripts. 
0.25 ^g RNA was hybridized to the GAPDH probe as a loading 
control. UP and PP designate the position of full length 
probe and protected fragment . Figure 4B shows the RNAase 
protection of Band 17 expression by cultured juvenile long 
bone chondrocytes. The chondrocytes were enzymat icaiiy 
released from the matrix and plated. Sample U (unpiated) is 
RNA extracted from a cell pellet prior to plating. Lanes 1, 
2 and 3 are RNA samples extracted from chondrocytes growing 
in monolayer for 1, 2 and 3 days. Figure 4C is a Northern 
Blot analysis of the expression of collagen types II and X 
with jS-actin as a control. The sample RNA from unpiated and 
cultured chondrocytes is identical to the RNA used for Band 
17 analysis in Figure 4B. 

Figure 5 is a schematic diagram of Band 17 
sequences, showing the alternative use of exons to form the 
2.2, 5.0, and 6.2 kb cDNAs . Question marks represent 
unknown cDNA and genomic sequences. A, B , C, D, and E 
represent exons . The 5 . 0 kb transcript includes exons A-D, 



9801468A1 I > 



WO 98/01468 



PCT/US97/11311 



- 9 - 

the 6.2 kb transcript includes exons A-C, plus E. The 2.2 
kb transcript contains exons A-C and only the first part of 
exon D (D s ) . Restriction sites are labeled below the genomic 
sequence diagram; Bg=BglIII, X=XbaI, E=EcoRI, and Nc=NcoI . 
Thick bars represent cDNA fragments used as probes to 
analyze bl7 mRNA expression and genomic structure. Probe I 
is the 0.25 kb Pstl-Bglll fragment that detects all 
transcripts (nt positions 106-354 in cDNA sequence given in 
Figure 7) . Probe II is the 0.26 kb fragment that detects 
the 2.2 and 5.0 kb transcripts (nt positions 4541-4800 in 
genomic sequence, Genbank Accession No. U59420) to be 
submitted to Genbank). Probe III is the 0.41 kb fragment 
that detects only the 5.0 kb transcript (nt positions 7413- 
7837 in genomic sequence) . Probe IV is the 0.33 kb Xmnl- 
Kpnl fragment that detects only the 6.2 kb transcript (nt 
positions 634-966 in Figure 7) . Probe V is the 0 . 7 kb 
fragment used as a probe for genomic Southern Blocs (nt 
positions 4391-5089 in genomic sequence) . 

Figure 6A and 6B are genomic Southern Blots.. Ten 
p.q genomic DNA was digested with either EcoRI (E) , Bglll 
(Bg) , or Xbal (X.) and the digested fragments were separated 
on a 1%, agarose gel. The DNA was blotted to GeneScreen 
Plus, then hybridized to a random primed probe. In Figure 
6A, the blot was probed with a 700 bp fragment, 
corresponding to probe V in Figure 5. In Figure 6B, the 
same blot was stripped and reprobed with probe IV (specific 
to 6.2 kb cDNA) . The position of size standards is 
indicated on the right . 

Figure 7 shows the cDNA sequence for the 6 . 2 kb 
transcript with the predicted translation. The reading 
frame within the 5.0 and 2.2 kb transcripts is congruous 
with that of the 6.2 kb transcript to position 587, which is 
the alternative splice point. The remainder of the 5.Q kb 
transcript is depicted schematically as exon D in Figure 5 
and starts at position 3948 in the genomic sequence. 
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Relevant restriction sites are underlined and labeled. 
Potential N-glycosyslat ion sites are underlined in the amino 
acid sequence. Exons are labels in outlined letters that 
correspond to the exons shown in Figure 5. 
5 Figure 8A compares the nucleotide homology between 

the chicken bl7 sequence and combined human cDNA sequences 
from the national sequence data bank ("NCBI") The human 
sequence was derived from taking nt#l-#268 of clone c-3af01, 
Accession Number F12482, then adding 187 nt of clone 
10 c-lxbOl, starting at position 182. Numbering for the 
chicken sequence is as shown in Figure 7. Figure 8B 
compares the homology of predicted amino acid sequences for 
the nucleotide sequences given in Figure 8A. 

15^ DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to isolated DNA 
molecules encoding proteins or polypeptides selectively 
expressed in chondrocytes in lower proliferative or upper 

20 hypertrophic zones of long bone and embryonic vertebrae 
growth plates. These DNA molecules can also have the 
following characteristics: (1) expression of ihese DNA 
molecules is predominantly found in cartilage destined for 
mineralization, and their transcription products is 

25 undetectable in articular cartilage and undetectable or weak 
in kidney, liver, lung, skin, spleen, brain, heart, and 
muscle tissue; (2) expression of these DNA molecules is 
increased by induction of a hypertrophic phenotype in 
progenitor sternal chondrocytes by treatment with ascorbate; 

30 and (3) these DNA molecules are transcribed to form mRNA 

which exhibits a rapid but transient rise when hypertrophy 
is induced in growth plate chondrocytes in short term 
monolayer cultures . 

One such DNA molecule comprises the nucleotide 

3 5 sequence corresponding to SEQ. ID. No. 1 as follows: 
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GATCACTGCG ACAAGTTCGT 
GTACCAAGTG AACGCCTTCA 
AGAAGGTGGC GAGTGCCCTG 
AACAGAGCGG CCCCGGGTAC 
5 CAGTTATGCA GAGTTCCTGT 
TGCAGGGTTC GTGTGGGGCT 
AGAAAACAC C TTTGGGAGTA 
TGTACCTTTG TTTTAAGGGC 
GTGTTACCTT TTAATTCTCC 

10 GATAGTTGAA GGGAAGGGAG 
ACAGGGAGCA CATTTCCCAT 
TGTCATTTCC AGCATTCCCC 
TGCAGCTCAA ACCCAGCCAG 
ACTGCTCCAC AGTGAGCCGG 
S 15 TGTGATAGTA AGGCTATCCC 
CAGTGTGGAA TGTGTGGTTT 
AACTCTATTG GAAACTTTGT 
AACCCTAAAT TGTAGTTACT 
GAGAAAGTTG CTAGTTCTGT 

20 GGATCTATTG CAGATCTCGT 
GTTGGCTATA AAAAATGTGA 
K. AAGATGCTAA GGTAGGTGCT 
CTTCAAAACA TGCCTCACTG 
TTCTGTGCCC GTCCTTCTTC 

25 GCTGCATTGG TCTCCAAAGA 
ATACCTTCTG CATTCAGACT 
ACTAAAATGA TCTTTTCCAT 
TTTCTGTAAC ACCTGAAATG 
TTTTATATGA TAAAATGTCT 

3 0 TAGGTCCTCT GTGACTTCTT 
TTAATAAAAC TAGTAGAACC 
TTTGGAGGGA TTGAGAGATT 
CACTGCAAAA CACGTTTAGC 
TTAGCAGCCC TGCCAAAGAG 
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GGCCTTCGTG GAGGACAACG ACACAGCCAT 
AAGAGGGCCC GGAGATGAGG AAGGTGTTGG 
TGTCTGCCGG CCAGCGAGCT GAACGCAGGT 
GCTGCGCTCA GTGTGATGCG GGATGTGCTG 
CTAAAATACA AGCTGAACCA GATGCAGTCA 
GCAGTAGTGC GTGCTTGTTA GTCAACAGAA 
TCTTTCTTGG AGACGAGTGG AAGTATCAGC 
TCAGCTTTAC TTTTGCTTTG AGTTATGAGT 
TTCTGTAAAA TGTTGCAATT CAAGCATGCA 
GATGTGTCTG CGTTGTACCT TCGCTTGTCT 
GCTCAGGAAG CCCCCAGAAA TAAGCACTGC 
CAAAGATGTG ATCCTAAAAC CACGTCACGC 
CAGCATACAG GTTAAGCATG GCAGCCTGAG 
CACGCCTCCA CCTGCCCCTC TTCTGCCTTT 
AGCAGTGGGA CTATCACAGG TGCATCAGTT 
TGTTTCCCTG AGGTTTGCAT TCTGCACGAT 
TGCTTGGCAT TTGGGCTGGT GATTGTTTTC 
CGTACAAAAC CATGACAAGG GGAAAGTTGG 
GGTGGTGGTT TTATCCCTTG CTCCTTTCTT 
TCAAGTGGCT TTCCTCACTT GCTCGTATGA 
CCTCCCCGTG GTGTTCGCTC TTCAGTGAAG 
AAATG CAGAG GGCAGAGAGA TTTGAGAAGC 
TTTGGATGTT GTTTTGTGGG CAGTTGTAAG 
AACCTTCATT AGGTTTGGTG CTCCATTAGC 
GCTGTGGGTT AATCAAGCAG TAGGACTGAA 
TAAATATTGG CAGTGTCTTA ATTTGTCCTG 
TGCACACTTA ATTCATGTAA TGCTTTTTTC 
CTCTGGACAA CTTTGTTTTA CATGTATTAT 
TGATTTTAGA GGACAGCAAA TAAGGTCTTT 
TTCTGAGGCC CAACTGGTCT CTAATTCCTG 
TGGATAAATA TGACTTGCTT TGGATTACTC 
TGGGGATTAA GAATGATGCC ATTTATTTGG 
AATGCCCCTG CAGAGGCTCC TAAAGGAAGC 
AAAAACCCTG GAGTCAGGAG GAAGCGGTCT 
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CCTCTCAAAG AAGAGGAGGG TCAGCAGGAA TTTGTGCTGT TTC CTTCTAA 
TAGCTTAGTG AGAGAGGAAA GCTTGCTGAT TAAGCGGTTA CTTGGCACGT 
TAAGAATATG GGGTGTTTGA GCAGCTCTGC TGGAAGACTC TACAAGGTTG 
AATTGCCCAG CAGTGCAGTG GCAGTTGGTG TTCAGTGTGA AATTACGTGC 
5 ATGGAGTAAG AGGTTAAAGC TCCATCAGTG AGGTGGTGGG CTCTCAGATC 
CCTTTTTATT ATTTATTTAT TTATTTTCAC TGTATGCAAT AGTAAAAACT 
TGTAAACTGT GTTAACTTTA GGTACTGGAG TACCTGAATG ACCTGAAGCA 
ATACTGGAAG AGAGGATATG GCTATGACAT CAATAGTCGC TCCAGCTGCA 
TTTTATTCCA GGATATCTTC CAGCAGTTGG ACAAAGCAGT GGATGAGAGC 

10 AGAAGGTAAA TTAAAAAAAA AAAAAGGGGG GGGGGGGGGG GAAGCTTTTG 
TGTTGACTGA CTGCAAGCTT TCTGTGGTTA ATCCTGAGTT GGATTTGAGT 
AG C AGTT AAA CACTTCAGAC ACAAGAATGC TAGGAGAAGT TTGGTTAGGA 
GAACTTGTGA TTAGAGAGAA CAAAATCCTT AATAGGATCG TTACTGTAGA 
^ GTG CAAATAG GCTTGAGGTT TTATTTTTCC CATTGATGCT TTTGTGCCCA 

15* GTGGATTTAT TTCCATCTTT TAACTTACTG ATCTGCACAG GCCTTCAAAG 
GACAGCCAGT TACTGTGTCT GACAGTGGTG GTTTTTTCCT GCTGAACAAT 
GAATTTTTTG TTTAAAATGT CTTTGTTAAA AAGCATTTGT GGTGAAAGTG 
GAAAGGCTGT AGGTTAAAAA AAGCAATATG ATCGATTCTG CTTTCTGGTT 
ACTTAAACAC TTCAG CATGA AAGTCTTGTT TTCTTTCCAT GTGTGTTTGA 

2 0 CATCTCTTGC ACTATTAAAG CTTTCTGAGC TTTAAAGCTT CAGGCTGAAG 

GTGCTGAAAT GCAATTACAA AAGAATAATT ATTTCAAGTG AATCCAAACA 
CTCAGTGACC CTAGATGAGA ACTGCCTGTT GCAGAATCCA CCAAGCCTGA 
ACTGTAACAG CAAACCAGCC TTGTCATGCC TGCTTCTTTG TAACTGCAGA 
AAGACAAACT TAGGCAGTAT ACTCGGTCCC TGCACAAACA GGAGAAAGGT 
25 ACTTGAGCCC TGAGGCTGTT GTAAAAGCCT TGGTTTGTTG TACGAACATG 
AGGCCAGTAA TTTAGCCAGC CAGCCACTCT CTTAGATATT TACTTTCGCA 
" TCCTTACTCA TCTGCAGCAA AACTGCCCAT TGGGAGCAAT GCTGTAGGTG 
TAGGAAGTTG TTAGACCTCA CATGTATCTG TTAGCAGACA C AAAG AT AG C 
ACAAGCAAGA GTCTGCAGAG GAGGGTGGTC TGATGAAGTG GTTTGTGTTC 

3 0 AGCTAGTTCC ATGGTTTGGC AAGTCATTTT GTGTCAGAGA AGGAAGAACA 

GCAGTGGTAC TCCTTCCAGG AACTCTTACA GC~CCTCAAAA TTG CCTTTAA 

v 

CGTGCCTTGG AGGTACCTAT GCTTCCTTAA AAGCTAAAGA CAAGATGCCT 
GTGTTCTTGT GTGTATTGTT TACTCCTATC AGCTGCTATC AGTCGGCAGC 
GGTGATCTGT TGTAACCTAG AGAAAACAGT ATAGAAAACA AAGGCTTTAG 
3 5 TTACAGGTTT GGGTGTTTAT GTCACAAGAT TAG CTGT ATT TGCTTTCATG 
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TGCCAGTAAT AAAATTTTTG 
GCATATGGGA ATAATTTACA 
GAATTACAAA TTCATAGCTT 
CAGGTAAGCG CACAAATGTT 
5 AACAAGATTG TGTAAACATT 
TATGAGTAGC GTAAGGTGAC 
AGTAGTTGTA TGTGATGGGT 
T AGTT C T AAA TTTAGGGTAC 
AGGAACGAAA GCTGAAGTCA 

10 TTTTTACTTG ATAA C ATGTA 
ACAGATTGAA AATAGAGGTA 
TATCAAATTG TGAAGTCTTC 
CTGGCACTAC C CTGGGAT AC 
ATTCTGTAAC GCCAATCACT 

15 CATCTAGATT TTTGCTCACT 
GAACTGACTG TTAAGTCAGT 
TTGTTGTGCT TCAGTGCTGG 
CATTTTGCTA TGAACAGGCT 
AAACCAACAT AGCACCGAGG 

2 0 AAGTGG CTGT TGATAGCAAA 

CTAAATCATT CTGGTTTTTG 
S, CCTCTCATTT TGAGAGTGGT 
CCTTTAAATA AATCTCTACT 
AATAAAAAAT GCATGAGGCC 
25 TTGACAAGCA CACTGCTGTG 
GCGAAGATAG AAAATATAAA 
GTGAGCAGAC AAGCAGAGCA 
AATAACTTCT TAATGTAGCT 
TTTGCTTTTA GTGTATTTAT 

3 0 AAGTGCTACT TAGAAAATAG 

AAGTTTAATA ATGATAAAGC 
CAGTTTATTG ACAGCTCATT 
GGTGTTGGAG GCAGAGGAGT 
GAGCTTTGGC CATGGAATCA 
3 5 TCTGTAAGGA TCATCGAGTC 
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AGAGCTGCGT TAGGCTTAAA AACAGTGCAT 
AC CTG CATG A ATGTTGTTTT TCTAACAGAG 
AGTGAT CAGC CATGTGAATC AGTACCTGAG 
TACAAAAGCA CACAAAATCA AGGAGGTGAT 
GTGCCTTTAA ATGGTTCGTT GGAATCAATG 
CAAGTTCAGC TTTGATATTG ATATAGAAAA 
GTACTTACAT TGCTAGCATC CTTGGGGTTC 
TGAAGTAGGT CAAAAATTAT TTAGTGTTTC 
CTGATACTTG AAGCTATATG TGTGTATTTT 
AGAAAGCACT TTATTTTCCC CTGT CAGTTG 
GCCTTGCAAT TTTGGATCAG AGGAATGATC 
CTCCTTGGAA GAAAAGCTTC AAAAGCTGCC 
AGCCTCCAGA GGTCCCTTCC CACCTCAAGC 
TCTTACAAAG AGGACTGCGA AGAAGTTGTT 
GAGGATCTGA GTTAAATATC AACAGTGATA 
TGAAG CAGAA TTCTCAGTCA GTTGGCTTTT 
ATGCAGAGAT GCTGTGTGTT AAGCCCTCTT 
AGAACTTGTT GTAAGCTAGT TGTAAGCATG 
ACTAATTGTG AAGGAAAGGT GGGCAGAAGG 
CTCTCTGCAG CAAGCCTGGA CATTGTGCTG 
GAAATCTAAG GGCTGTCAGA GCTGTTGATC 
GGAGTCAAAG CTGTGGTTAT GCTAGATTGC 
GTATCCTTTC TTCAGCATTC TGGGAAGCTA 
ACAGGTCATT TACATCCAAC TGTGAAGAGA 
ATTGCTTCCA TATATGCTGT GTCTGCTTCT 
CAGAATGAGG AGACGAAGAG CAGATTAAAA 
AAACCCCTCT GCCCTTCTGA AGGAAAAAAA 
TGTCTCATAT AAGGAGAATA ATTAGATCTA 
TCTATGAGCA GGGAAAGCCT TTAAATCCTT 
CTTTAATTCT TAACTGTTTA TTAAGTCTGT 
TATAATTGAC AAAATCCACA TCTGTACTTC 
CAGCAGCCCC TAAATTTCTT GGGAAGAGCA 
AAAAGATTGA GATGATCTCA TCCTGTCTTA 
GAATCACAGA ATATCCCAAG TTTGGAGGGA 
CAATTGTGAT GTTTAAAACA TGTCATTTAG 



NSOOCID: <WO 9801488A1 J_> 



WO 98/01468 



PCT/US97/11311 



CAATGAGGTG TTGAGGAGAA 

AGGATGGTCC CTCCTGGTCA 

ACCCAAAATC TCCACTGGTT 

GTCCTGAGCT TCTGCTGATA 

5 TCACAAGTGA GACAATTCTA 

GCCAAAATAC AGGTATACAA 

AGAGTTTGAA CAGGCACCCT 

TGGTGTTGCA ATCCTTTTGA 

CAGTAAGCCA GGACCTCCAA 

10 GGTTTGAAGA CACCTCTAAG 

CTCCTGTGGC TGCCCTCAGA 

AGGTTGTGAT TATTTGGGAG 

GCCCTCTCAG AG CATGGAAG 

.,. GCTGTCAGTC TGCAAATAGC 

15*' GCTGTCAGTC TGCAAATAGC 

TCCCCTTCTA CGGACTGCTG 

TTCATTTGGG AAAGTTATTC 

GAGGGAGAAG TCCTCCCAAG 

CACCGAATTG TGCAGGAGAT 

20 AACCCAGGGG ATCATCTCCA 

TACAAATTTA TGTCTGGTAA 

TCCAAAGCAC ATTGTCTGAA 

GTCAAGAAAT TGACAGAAGT 

TTCAGTGATT TCTGGAATAT 

25 TGTGATATTT AAAGATGTAA 

TGTTACCCCA GACAGAAGCA 

TGGAGACCCT GGCAAAAAAT 

GCAGCCTGCT TACCCTGTGT 

ACTTACTGCC TTCTGCAAAA 

3 0 GTGGCAGTAT TAAAG CACTG 

CTGGTTCACT TAAAAGTAGG 

CACCTTGTGT CTCTTCATCT 

ATCTAGGGAG GGAGACTTAT 

CTGAGAACAA GAAACCATAC 

3 5 AGAAAATACT TGGTTGTGTA 
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GCAGTGAAGG CCAGCAGATG GATGTCTGTC 
CTGCTAGTCC CTTCTTGTTT GAAAGGAAAC 
AAAACTTGTC ACTAGAACCC ATCTAGGAGA 
AGCTGTAAAA TCAATTGTGA TCAAACATGA 
GGGATGCCTG GAGGGAAATG ACCCACAGAG 
CTGGGGTTTT CTACCTAAAC TGAGGTGCTG 
ACCCTATAAC ACCCTGTTGC TCACCATGGA 
ATTAAGCATG TGGCTCCATG AGGCTGGCAC 
ATGACAGAGT ACAACTGATG GAATCACTGA 
ACCATTGAGC CCAACCAGCT CATCCTTGAG 
GCTGCTACAC CCTCATCTCT GTT CAT TAC C 
GAAGCTTGCC TCCTCCTTCC AG C C AG GAGA 
CAATTAGTAT TTTCAGTCAA TCCAATATAT 
CAACTAAACA ACATGCCAGC GTGCTGCCAT 
CAACTAAACA ACTAGCCAGC GTGCTGCCAG 
GTCTCCCAGG GATAACTTCA GGAAAGCTGT 
CATGGCATCT GCTGCAGGAC ATACAGCTGA 
CACAGGAGAA CATCTCCCAT CCTATGGAAG 
AACCAACTGA AAAACACAAA CTTACATCCT 
GTAGTCCAAT TTTTGATAGA CAAATGTAAG 
AAGCCAAGAA AATGGGTCAA G C AAAATTT A 
GAATGATGTG AT AT ATT C AG CAAAACCGAT 
TTAAAATAAT AGCAGATGAC TTCAGAGATT 
ATTATAAAAG CAAAAATATT TGCACTGATC 
CTGGGAAGAA TCACTGTTCA GATGTGTTGT 
GGTAGTGAGT TTGTGCACAT GTG TGG AG AG 
GGAGATCTGG CAAAATTCAA AGCTGGGTGA 
GTTCTAAAGT GGGGGCTGAA GG CAT C TCAA 
CGAGCATGTA ACCCCATCCC GCAACGTCAG 
AAGGCTTGAG TACAGTCTCT ATTAGGCAAC 
TGGAAATCTA CCACCACCAA TGTAGGAGAG 
GGGGAGTGGA GATACAACTA ACAATCCTTC 
GTGGGGACCT GAAGCAATTT GAGAGTACAG 
AAAAGGAAAA TATGCATATT TTTTAGCCGT 
TGCATGTGTT ATTATGACTA TATAGTGTTA 
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TTACTATATC TTTAATGATA TAGTACAGTT CTGTATTTAA TCTGTTGCCC 
CACCTGCAGC TGTTAATTGC TCAGAAAATG AGCCTCTGTG GTGGCAAAAT 
GTTGT CTTAT TTATCCGTGT TTTAACACTG ATATATATCT CTGGTTTGTT 
CTGATACTAC AGGAAGAATG . ATTTTATTTC CAGAATCTTA CTGTTGCTCC 
AAGTTCTCCT TTTTTTTTAA AAATGAAAAG TTTAGTTTGG GCTATCCAGT 
AGCAGCTGTT GG AG CATTTG TGCTCCAGCA AGGAGTTATG GTGTCTGGCT 
TTGTGTTTCT GTTCTAGGCT TGTTGGTAGA GAATGGCATT GCCAGCTCTG 
CATTTTATAG CATATTTCAA ATATTTATAT TTAGCAGTTT GCCCCGTTTT 
CATTCCTTGT TACAGCTCAA ATAAAATGAG AGCTTTTACT TGTAACCCTT 
TTTCTTCCAT GAAGCTTTTA TTGACCCAGC AATCTGATTT CTGATTATTT 
GCCTAATTAG TTGCCTTATT AAAGCTCACT CTTCTTTCTT CTGGAAAAAG 
TACCTTCTGG AATAATGTCG GCCCTTAAGA AAATGATGAA AATTACTGAA 
ATTCTCAAGA TTTTAACTAT GAGACCATTA GAGAGTTGGT ATTTGAGTTA 
CAACTTTGAT GTCTCAGATG TGAATGTTTG GCGTCTCCAT TCTTCTGCAC 
CTTCAGTAGC AATAAAACAT TAATGTCCTG TAAAGGTTAA TTCCTTTTCT 
TTGAGACCTT ACCACTGTCA AATAGGTTCT TCCAAGACCA CATTCCTCTG 
TGTCTCCTTG CCTGTCTGTA AGGTGATACA GTGATAACGT GTCTGGGGAG 
AGTTTGAGTG CCACAACTCT CCCATAAAAA GTTTCTTATT TAGAAGAAAA 
AGGAAATAAT ATTATAGGAG TGGAGTAAAG TTAAACCAGG TGAGTTGTGC 
TAAAATGGCA TACTTGGGAA GTTGTCCAAG T C CAAAT AAA GAGCTTTATT 
TTTGTGATAA . GG AAAGGATT AAATTCTTCT CATGTCTGTC CGTTATGGAT 
l h AGCCAACAAT CAGACCATGC AACTATATGG CAAAGAAGCC AATGGGGTAA 
TACTCTTCTC TGAACTGTTG GTTTTTTTCC ATACTGGAAC CTTACAGAAA 
ATGTCCCTAC TCTTCATTAT GTGGGCAAAA CTGACAGGTA GCGATGTGCT 
TGTACTGCTG CACTTGGCGT TGTGCTGCTA TGGAAGAATC TCGAAAGGCT 
GCTCTGCATT TGATTGAAGA GTTAGTGTCC AATTTCCCAC AGTTGTGGTA 
TTTGGAGGAA GTTTTAACAG TGGTACATAG AGGAGCAATA GATGAGTGTC 
TCTCTGCCTT GGAAGAAGCT T 

Another such DNA molecule comprises - the nucleotide 
sequence corresponding to SEQ . ID. No. 2 as follows: 

GGCACGAAGG GAGGCGAGAG GATCCCGGAG CAGCTGGAGC AGGCGGCCGC 
GCCCGTCCTC CTCTTCCTGC AGCTGCCGCC ATGGCGCCGT GCCGCGCTGC 
CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC GAGCGCCGGG CTGGGCGGCT 
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ACTTCGGCAC CAAGTCCCGC TACGAGGAGG TGAACCCGCA CCTGGCGGAG 
GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC TGCCCGCCGC 
CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC 
CCACGGCCGG GCAAATCCGC CGCCTGGCCG AGCTGCACGG CCGCCTCCGC 
5 CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC GCCGCGCTGG CCGCCTGGCC 
GATGTGGTAC GAGGAGAGCC TCGACGGGCG GCTGGCGCCG CGGGGCCGCC 
GCGACATGGA ACACCTGGCG CGCCGCCTGG CCGCCCGCTT CCCCGCGCTC 
TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA AGCACCGCTG 
CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG CCTCGGGCCC TCCCTCAGCC 

10 TCGGCGCCGA CGAGACGGAG ATCGAAGTGA ACGACGCGCT GATGAGGTTT 
TTTGATCACT GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC 
CATGTAC CAA GTGAACGCCT TCAAAGAGGG CCCGGAGATG AGGAAGGTGT 
TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC CGGCCAGCGA GCTGAACGCA 
GATCTCGTTC AAGTGGCTTT CCTGACTTGC TCGTATGAGT TGGCTATAAA 

15 s ' AAATGTGACC TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 
TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC 
TATGACATCA ATAGTCGCTC CAGCTGCATT TTATTCCAGG ATATCTTCCA 
GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG AAGTTGACAG ATTGAAAATA 
GAGGTAGCCT TGCAATTTTG GATCAGAGGA ATGATCTATC AAATTGTGAA 

2 0 GTCTTCCTCC TTGGAAGAAA AGCTTCAAAA GCTGCCCTGG CACTACCCTG 
GGATACAGCC TCCAGAGGTC CCTTCCCACC TCAAGCATTC TGTAACGCCA 
ATGACTTCTT ACAAAGAGGA CTGCGAAGAA GTTGTTCATC TAGATTTTTG 
CTCACTGAGG ATCTGAGTTA AAT AT CAACA GTGATAGAAC TGACTGTTAA 
GTCAGTTGAA GCAGAATTCT CAGT CAGTTG GCTTTTTTGT TGTGCTTCAG 

2 5 TGCTGGATGC AGAGATGCTG TGTGTTAAGC CCTCTTCATT TTGCTATGAA 

CAGGCTAGAA CTTGTTGTAA GCTAGTTGTA AG CATG AAAC CAACATAGCA 
CCGAGGACTA ATTGTGAAGG AAAGGTGGGC AGAAGGAAGT GGCTGTTGAT 
AGCAAACTCT CTGCAGCAAG CCTGGACATT GTGCTGCTAA ATCATTCTGG 
TTTTTGGAAA TCTAAGGGCT GTCAGAGCTG TTGATCCCTC TCATTTTGAG 

3 0 AGTGGTGGAG TCAAAGCTGT GGTTATGCTA GATTGCCCTT TAAATAAATC 

TCTACTGTAT CCTTTCTTCA GCATTCTGGG AAGCTAAATA AAAAATG CAT 
GAGGCCACAG GTCATTTACA TCCAACTGTG AAGAGATTGA CAAGCACACT 
GCTGTGATTG CTTCCATATA TGCTGTGTCT GCTTCTGCGA AGATAGAAAA 
TATAAACAGA ATGAGGAGAC GAAGAGCAGA TTAAAAGTGA GCAGACAAGC 
3 5 AGAGCAAAAC CCCTCTGCCC TTCTGAAGGA AAAAAAAATA ACTTCTTAAT 
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GTAGCTTGTC TCATATAAGG AGAATAATTA GATCTATTTG CTTTTAGTGT 
ATTTATTCTA TGAGCAGGGA AAGCCTTTAA ATCCTTAAGT GCTACTTAGA 
AAATAGCTTT AATTCTTAAC TGTTTATTAA GTCTGTAAGT TTAATAATGA 
TAAAGCTATA ATTGACAAAA TCCACATCTG TACTTCCAGT TTATTGACAG 
5 CTCATTCAGC AGCCCCTAAA TTTCTTGGGA AG AG CAGGTG TTGGAGG CAG 
AGCAGTAAAA GATTGAGATG ATCTCATCCT GTCTTAGAGC TTTGGCCATG 
GAATCAGAAT CACAGAATAT CCCAAGTTTG GAGGGATCTG TAAGGAT CAT 
CGAGTCCAAT TGTGATGTTT AAAACATGTC ATTTAGCAAT GAGGTGTTGA 
GGAGAAGCAG TGAAGGCCAG CAGATGGATG TCTGTCAGGA TGGTCCCTCC 

10 TGGTCACTGC TAGTCCCTTC TTGTTTGAAA GGAAACACCC AAAATCTCCA 
CTGGTTAAAA CTTGTCACTA GAACCCATCT AGGAGAGTCC TGAGCTTCTG 
CTGATAAGCT GTAAAATCAA TTGTGATCAA ACATGATCAC AAGT GAGAC A 
ATTCTAGGGA TGCCTGGAGG GAAATGACCC ACAGAGGCCA AAATACAGGT 

A * ATACAACTGG GGTTTTCTAC CTAAACTGAG GTGCTGAGAG TTTGAACAGG 

-3.' 

15 CACCCTACCC T AT AAC AC C C TGTTGCTCAC CATGGATGGT GTTGCAATCC 
TTTTGAATTA AGCATGTGGC TCCATGAGGC TGGCACCAGT AAGCCAGGAC 
CTCCAAATGA CAGAGTACAA CTGATGGAAT CACTGAGGTT TGAAGACACC 
TCTAAGACCA TTGAGCCCAA CCAGCTCATC CTTGAG'CTCC TGTGGCTGCC 
CTCAGAGCTG CTACACCCTC ATCTCTGTTC ATTACCAGGT TGTGATTATT 

2 0 TGGGAGGAAG CTTGCCTCCT CCTTCCAGCC AGGAGAGCCC TCTCAGAGCA 
TGGAAGCAAT TAGTATTTTC AGTCAATCCA ATATATGCTG TCAGTCTGCA 
! i= AATAG € C AAC TAAACAACAT GCCAGCGTGC TGCCATGCTG TCAGTCTGCA 
AATAGCCAAC TAAACAACTA GCCAGCGTGC TGCCAGTCCC CTTCTACGGA 
CTGCTGGTCT CCCAGGGATA ACTTCAGGAA AGCTGTTTCA TTTGGGAAAG 

2 5 TTATT CCATG GCATCTGCTG CAGGACATAC AGCTGAGAGG GAGAAGTCCT 

CCCAAGCACA GGAGAACATC TCCCATCCTA TGGAAGCACC GAATTGTGCA 
GGAGATAACC AACTGAAAAA CACAAACTTA CATCCTAACC CAGGGGATCA 
TCTCCAGTAG TCCAATTTTT GATAGACAAA TGTAAGTACA AATTTATGTC 
TGGTAAAAGC CAAGAAAATG GGTCAAGCAA AATTTATCCA AAGCACATTG 

3 0 TCTGAAGAAT GATGTGATAT ATTCAGCAAA ACCGATGTCA AGAAATTGAC 

AGAAGTTTAA AAT AATAG C A GATGACTTCA GAG ATTTT CA GTGATTTCTG 
GAATATATTA TAAAAGCAAA AATATTTGCA CTGATCTGTG ATATTTAAAG 
ATGTAACTGG GAAGAATCAC TGTTCAGATG TGTTGTTGTT ACCCCAGACA 
GAAGCAGGTA GTGAGTTTGT GCACATGTGT GGAGAGTGGA GACCCTGGCA 



3NSDOCID: <WO 9801468A1 J_> 



WO 98/01468 



PCTVUS97/11311 



- 18 - 



AAAAATGGAG 


ATCTGGCAAA 


ATTCAAAGCT 


GGGTGAGCAG 


CCTGCTTACC 


CTGTGTGTTC 


TAAAGTGGGG 


GC.TGAAGGCA 


TCTCAAACTT 


ACTGCCTTCT 


GCAAAACGAG 


CATGTAAC'CC 


CATCCCGCAA 


CGTCAGGTGG 


CAGTATTAAA 


GCACTGAAGG 


CTTGAGTACA 


GTCTCTATTA 


GGCAACCTGG 


TT C ACTT AAA 


AGTAGGTGGA 


AATCTACCAC 


CACCAATGTA 


GGAGAGCACC 


TTGTGTCTCT 


TCATCTGGGG 


AGTGGAGATA 


CAACTAACAA 


TCCTTCATCT. 


AGGGAGGGAG 


ACTTATGTGG 


GGACCTGAAG 


CAATTTGAGA 


GTACAGCTGA 


GAACAAGAAA 


CCATACAAAA 


GGAAAATATG 


CATATTTTTT 


AGCCGTAGAA 


AATACTTGGT 


TGTGTATGCA 


TGTGTTATTA 


TGACTATATA 


GTG TTATT AC 


TATATCTTTA 


ATGATATAGT 


ACAGTTCTGT 


ATTTAATCTG 


TTGCCCCACC 


TGCAGCTGTT 


AATTGCTCAG 


AAAATG AG C C 


TCTGTGGTGG 


CAAAATGTTG 


TCTTATTTAT 


CCGTGTTTTA 


ACACTGATAT 


ATATCTCTGG 


TTTGTTCTGA 


T ACT AC AG G A 


AGAATGATTT 


TATTTCCAGA 


ATCTTACTGT 


TGCTCCAAGT 


TCTCCTTTTT 


TTTTAAAAAT 


GAAAAGTTTA 


GTTTGGGCTA 


TCCAGTAGCA 


GCTGTTGGAG 


CATTTGTGCT 


CCAGCAAGGA 


GTTATGGTGT 


CTGG CTTTGT 


GTTTCTGTTC 


TAGGCTTGTT 


GGTAGAGAAT 


GGCATTGCCA 


GCTCTGCATT 


TTATAG CATA 


TTTCAAATAT 


TTATATTTAG 


CAGTTTGCCC 


CGTTTTCATT 


CCTTGTTACA 


GCTCAAATAA 


AATGAGAGCT 


TTTACTTGTA 


ACCCTTTTTC 


TTCCATGAAG 


CTTTTATTGA 


CCCAGCAATC 


TGATTTCTGA 


TTATTTGCCT 


AATTAGTTGC 


CTTATTAAAG 


CTCACTCTTC 


TTTGTTCTGG 


AAAAAGTACC 


TT CTGGAATA 


ATGTCGGCCC 


TTAAGAAAAT 


GATGAAAATT 


ACTGAAATTC 


TCAAGATTTT 


AACTATGAGA 


CCATTAGAGA 


GTTGGTATTT 


GAGTTACAAC 


TTTGATGTCT 


CAGATGTGAA 


TGTTTGGCGT 


CTCCATTCTT 


CTGCACCTTC 


AGTAGCAATA 


AAACATTAAT 


GTCCTGTAAA 


GGTTAATTCC 


TTTTCTTTGA 


GACCTTACCA 


CTGTCAAATA 


GGTTCTTCCA 


AGACCACATT 


CCTCTGTGTC 


TCCTTGCCTG 


TCTGTAAGGT 


GATACAGTGA 


TAACGTGTCT 


GGGGAGAGTT 


TGAGTGCCAC 


AACTCTCCCA 


TAAAAAGTTT 


CTTATTTAGA 


AGAAAAAGGA 


AATAATATTA 


TAGGAGTGGA 


GTAAAGTTAA 


ACCAGGTGAG 


TTGTG CTAAA 


AT GG CAT ACT 


TGGGAAGTTG 


TCCAAGTCCA 


AATAAAG 







30 

This DNA molecule encodes for a protein or polypeptide 
having a molecular weight from about 34 to 40 kDa , 
preferably about 3 7 kDa, and having an amino acid sequence 
corresponding to SEQ. ID. No. 3 as follows: 

35 
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MAPCRAACLL PLLVAVASAG LGGYFGTKSR YEEVNPHLAE DPLSLGPHAA 

AARLPAACAP LQLRRWRHG TRYPTAGQIR RLAELHGRLR RAAAPSCPAA 

AALAAWPMWY EESLDGRLAP RGRRDMEHLA RRLAARFPAL FAARRRLALA 

SSSKHRCLQS GAAFRRGLGP SLSLGADETE I EVNDALMRF FDHCDKFVAF 

5 VEDNDTAMYQ VNAFKEGPEM RKVLEKVASA LCLPASELNA DLVQVAFLTC 

SYELAIKNVT SPWCSLFSEE DAKVLEYLND LKQYWKRGYG YDINSRSSCI 

LFQDIFQQLD KAVDESRS 

Another such DNA molecule comprises the nucleotide 
10 sequence corresponding to SEQ . ID. No. 4 as follows: 

GGCACGAAGG GAGGCGAGAG GATCCCGGAG CAGCTGGAGC AGGCGGCCGC 

GCCCGTCCTC CTCTTCCTGC AGCTGCCGCC ATGGCGCCGT GCCGCGCTGC 

CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC GAGCGCCGGG CTGGGCGGCT 

J 15 ACTTCGGCAC CAAGTGCCGC TACGAGGAGG TGAACCCGCA CCTGGCGGAG 

GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC TGCCCGCCGC 

CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC 

CCACGGCCGG GCAAATCCGC CGCCTGGCCG AGCTGCACGG CCGCCTCCGC 

CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC GCCGCGCTGG CCGCCTGGCC 

2 0 GATGTGGTAC G AGG AG AG C C TCGACGGGCG GCTGGCGCCG CGGGGCCGCC 

GCGACATGGA ACACCTGGCG CGCCGCCTGG CCGCCCGCTT CCCCGCGCTC 

K TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA AGCACCGCTG 

CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG CCTCGGGCCC TCCCTCAGCC 

TCGGCGCCGA CGAGACGGAG ATCGAAGTGA ACGACGCGCT GATGAGGTTT 

2 5 TTTGATCACT GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC 

CATGTACCAA GTGAACGCCT T CAAAGAGGG CCCGGAGATG AGGAAGGTGT 
TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC CGGCCAGCGA GCTGAACGCA 
GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT TGGCTATAAA 
AAATGTG AC C TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 

3 0 _ TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC 

TATGACATCA ATAGTCGCTC CAGCTGCATT TTATTCCAGG ATATCTTCCA 
GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG AAGTTGACAG ATTGAAAATA 
GAGGTAGCCT TGCAATTTTG GATCAGAGGA ATGATCTATC AAATTGTGAA 
GTCTTCCTCC TTGGAAGAAA AGCTTCAAAA GCTGCCCTGG CACTACCCTG 
3 5 GGATACAGCC TCCAGAGGTC CCTTCCCACC TCAAGCATTC TGTAACGCCA 
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ATCACTTCTT ACAAAGAGGA CTGCGAAGAA GTTGTTCATC TAGATTTTTG 
CTCACTGAGG ATCTGAGTTA AATATCAACA GTGATAGAAC TGACTGTTAA 
GTCAGTTGAA GCAGAATTCT CAGTCAGTTG GCTTTTTTGT TGTGCTTCAG 
TGCTGGATGC AGAGATGCTG TGTGTTAAGC CCTCTTCATT TTGCTATGAA 
5 CAGGCTAGAA CTTGTTGTAA GCTAGTTGTA AGCATGAAAC CAACATAGCA 
CCGAGGACTA ATTGTGAAGG AAAGGTGGGC AGAAGGAAGT GGCTGTTGAT 
AGCAAACTCT CTGCAGCAAG CCTGGACATT GTGCTGCTAA ATCATTCTGG 
TTTTTGGAAA TCTAAGGGCT GTCAGAGCTG TTGATCCCTC TCATTTTGAG 
AGTGGTGGAG TCAAAGCTGT GGTTATGCTA GATTGCCCTT TAAATAAATC 

10 TCTACTGTAT CCTTTGTTCA GCATTCTGGG AAGCTAAATA AAAAATG CAT 
GAGGCCACAG GTCATTTACA TCCAACTGTG AAGAGATTGA CAAGCACACT 
GCTGTGATTG CTTCCATATA TGCTGTGTCT GCTTCTGCGA AGATAGAAAA 
TATAAACAGA ATGAGGAGAC GAAGAGCAGA TTAAAAGTGA GCAGACAAGC 
AGAGCAAAAC CCCTCTGCCC TTCTGAAGGA AAAAAAAATA ACTTCTTAAT 

15- GTAGCTTGTC TCATATAAGG AGAATAATTA GATCTATTTG CTTTTAGTGT 
ATTTATTCTA TGAGCAGGGA AAGCCTTTAA ATCCTTAAGT GCTACTTAGA 
AAATAGCTTT AATTCTTAAC TGTTTATTAA GTCTGTAAGT TTAATAATGA 
TAAAGCTATA ATTGACAAAA TCCACATCTG TACTTCCAGT TTATTGACAG 
CTCATTCAGC AGCCCCTAAA TTTCTTGGGA AGAGCAGGTG TTGGAGGCAG 

20 AGC AG TAAAA GATTGAGATG ATCTCATCCT GTCTTAGAGC TTTGGCCATG 
GAATCAGAAT CACAGAATAT CCCAAGTTTG GAG 

This DNA molecule also encodes for a protein or polypeptide 
having a molecular weight of from about 34 to about 40 kDa, 
25 preferably about 3 7 kDa, and an amino acid sequence 
corresponding to SEQ . ID. No. 3 as provided above. 

Another such DNA molecule comprises the nucleotide 
sequence corresponding to SEQ. ID. No. 5 as follows: 

3 0 ATGGCGCCGT GCCGCGCTGC CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC 
GAGCGCCGGG CTGGGCGGCT ACTTCGGCAC CAAGTCCCGC TACGAGGAGG 
TGAACCCGCA CCTGGCGGAG GACCCGCTGT CCCTCGGGCC GCACGCCGCC 
GCCGCCCGGC TGCCCGCCGC CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT 
CCGCCACGGC ACCCGCTACC CCACGGCCGG GCAAATCCGC CGCCTGGCCG 

35 AGCTGCACGG CCGCCTCCGC CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC 
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GCCGCGCTGG CCGCCTGGCC GATGTGGTAC GAGGAGAGCC TCGACGGGCG 
GCTGGCGCCG CGGGGCCGCC GCGACATGGA ACACCTGGCG CGCCGCCTGG 
CCGCCCGCTT CCCCGCGCTC TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC 
AGCAGCTCCA AGCACCGCTG CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG 
CCTCGGGCCC TCCCTCAGCC TCGGCGCCGA CGAGACGGAG ATGGAAGTGA 
ACGACGCGCT GATGAGGTTT TTTGATCACT GCGACAAGTT CGTGGCCTTC 
GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT TCAAAGAGGG 
CCCGGAGATG AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC 
CGGCCAGCGA GCTGAACGCA GATCTCGTTC AAGTGGCTTT CCTCACTTGC 
TCGTATGAGT TGGCTATAAA AAATGTGACC TCCCCGTGGT GTTCGCTCTT 
CAGTGAAGAA GATGCTAAGG TACTGGAGTA CCTGAATGAC CTGAAGCAAT 
ACTGGAAGAG AGGATATGGC TATGACATCA ATAGTCGCTC CAGCTGCATT 
TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG 
AAGT 

This DNA molecule also encodes for a protein or polypeptide 
having a molecular weight of from about 34 to about 4 0 kDa , 
preferably about 37 kDa, and an amino acid sequence 
corresponding to SEQ. ID. No. 3 as provided above. 

Another such DNA molecule comprises the nucleotide 
sequence corresponding to SEQ. ID. No. 6 as follows: 

GGCACGAAGG GAGGCGAGAG GATCCCGGAG CAGCTGGAGC AGGCGGCCGC 

GCCCGTCCTC CTCTTCCTGC AGCTGCCGCC ATGGCGCCGT GCCGCGCTGC 

CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC GAGCGCCGGG CTGGGCGGCT 

ACTTCGGCAC CAAGTCCCGC TACGAGGAGG TGAACCCGCA CCTGGCGGAG 

GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC TGCCCGCCGC 

CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC 

CCACGGCCGG GCAAATCCGC CGCCTGGCCG AGCTGCACGG CCGCCTCCGC 

CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC GCCGCGCTGG CCGCCTGGCC 

GATGTGGTAC GAGGAGAGCC TCGACGGGCG GCTGGCGCCG CGGGGCCGCC 

GCGACATGGA ACACCTGGCG CGCCGCCTGG CCGCCCGCTT CCCCGCGCTC 

TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA AGCACCGCTG 

CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG CCTCGGGCCC TCCCTCAGCC 
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TCGGCGCCGA CGAGACGGAG ATCGAAGTGA ACGACGCGCT GATGAGGTTT 
TTTGATCACT GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC 
CATGTACCAA GTGAACGCCT TCAAAGAGGG CCCGGAGATG AGGAAGGTGT 
TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC CGGCCAGCGA GCTGAACGCA 
5 GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT TGGCTATAAA 
AAATGTGACC TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 
TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC 
TATGACATCA ATAGTCGCTC CAGCTGCATT TTATTCCAGG ATATCTTCCA 
GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG AAGTTCCAGG ATATCTTCCA 

10 GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG AAGTTCAAAA CCCATTTCTT 
CACCTTTGAT TGTACAAGTT GGACATGCAG AAACACTTCA GCCACTTCTT 
GCTCTTATGG GCTACTTCAA AGATGCTGAG CCTCTCCAGG CCAACAATTA 
CATCCGCCAG GCGCATCGGA AGTTCCGCAG CGGCCGGATA GTGCCTTATG 
CAGCCAACCT GGTGTTTGTG CTGTACCACT GTGAGCAGAA GACCTCTAAG 

15*'' GAGGAGTACC AAGTGCAGAT GTTGCTGAAT GAAAAGCCAA TGCTCTTTCA 
TCACTCGAAT GAAA C CAT CT CCACGTATGC AGACCTCAAG AGCTATTACA 
AGGACATCCT TCAAAACTGT CACTTCGAAG AAGTGTGTGA ATTGCCCAAA 
GTCAATGGTA CCGTTGCTGA CGAACTTTGA GGGAATGAAA TGGAGTGGCC 
GATTTGGAAA CCGATCTCAG TTTTCTTCAA CAGATGTTGT GAACGAGCAC 

2 0 TTTGGATGCA ATGCTGCTGC TGTGCCGACT CTCTAAGCTC GCAGATTTGA 

CGGCCGTTAT TTACCTGGG TTGTCTCTGTC AGCTCAA 

This DNA molecule encodes for a peptide having a molecular 
weight of from about 47 to about 53 kDa , preferably about 5G 
25 kDa, and has an amino acid sequence corresponding to SEQ . 
ID. No. 7 as follows: 

MAPCRAACLL PLLVAVASAG LGGYFGTKSR YEEVNPHLAE DPLSLGPHAA 
AARLPAACAP LQLRRWRHG TRYPTAGQIR RLAELHGRLR RAAAPSCPAA 

3 0 AALAAWPMWY EESLDGRLAP RGRRDMEHLA RRLAARFPAL FAARRRLALA 

SSSKHRCLQS GAAFRRGLGP SLSLGADETE IEWDALMRF FDHCDKFVAF 
VEDNDTAMYQ VNAFKEGPEM RKVLEKVASA LCLPASELNA DLVQVAFLTC 
SYELAIKNVT SPWCSLFSEE DAKVLEYLNB LKQYWKRGYG YDINSRSSCI 
LFQDIFQQLD KAVDESRSSK PISSPLIVQV GHAETLQPLL ALMGYFKDAE 
35 PLQANNYIRQ AHRKFRSGRI VPYAANLVFV LYHCEQKTSK EEYQVQMLLN 
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EKPMLFHHSN ETISTYADLK SYYKDILQNC HFEEVCELPK VNGTVADEL 

Another such DNA molecule comprises the nucleotide 
sequence corresponding to SEQ. ID. No. 8 as follows: 

5 

ATGGCGCCGT GCCGCGCTGC CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC 
GAGCGCCGGG CTGGGCGGCT ACTTCGGCAC CAAGTCCCGC TACGAGGAGG 
TGAACCCGCA CCTGGCGGAG GACCCGCTGT CCCTCGGGCC GCACGCCGCC 
GCCGCCCGGC TGCCCGCCGC CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT 

10 CCGCCACGGC ACCCGCTACC CCACGGCCGG GCAAATCCGC CGCCTGGCCG 
AGCTGCACGG CCGCCTCCGC CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC 
GCCGCGCTGG CCGCCTGGCC GATGTGGTAC GAGGAGAGCC TCGACGGGCG 
GCTGGCGCCG CGGGGCCGCC GCGACATGGA ACACCTGGCG CGCCGCCTGG 
CCGCCCGCTT CCCCGCGCTC TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC 

15 AGCAGCTCCA AGCACCGCTG CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG 
CCTCGGGCCC TCCCTCAGCC TCGGCGCCGA CGAGACGGAG ATCGAAGTGA 
ACGACGCGCT GATGAGGTTT TTTGATCACT GCGACAAGTT CGTGGCCTTC . 
GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT TCAAAGAGGG 
CCCGGAGATG AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC 

2 0 CGGCCAGCGA GCTGAACGCA GATCTCGTTC AAGTGGCTTT CCTCACTTGC 
TCGTATGAGT TGGCTATAAA AAATGTGACC TCCCCGTGGT G.TTCGCTCTT 
. CAGTGAAGAA GATGCTAAGG TACTGGAGTA CCTGAATGAC CTGAAGCAAT 
ACTGGAAGAG AGGATATGGC TATGACATCA ATAGTCGCTC CAGCTGCATT 
. TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG ATGAGAG CAG 

2 5 AAGTTCAAAA CCCATTTCTT CACCTTTGAT TGTACAAGTT GGACATGCAG 

AAACACTTCA GCCACTTCTT GCTCTTATGG GCTACTTCAA AGATGCTGAG 
CCTCTCCAGG C CAACAATTA CATCCGCCAG GCGCATCGGA AGTTCCGCAG 
CGGCCGGATA GTGCCTTATG CAGCCAACCT GGTGTTTGTG CTGTACCACT 
GTGAGCAGAA GACCTCTAAG GAGGAGTACC AAGTGCAGAT GTTGCTGAAT 

3 0 GAAAAGCCAA TGCTCTTTCA TCACTCGAAT GAAACCATCT CCACGTATGC 

AGACCTCAAG AGCTATTACA AGGACATCCT TCAAAACTGT CACTTCGAAG 
AAGTGTGTGA ATTGCCCAAA GTCAATGGTA CCGTTGCTGA CGAACTT 

This DNA molecule also encodes for a protein or polypeptide 
35 having a molecular weight of from about 47 to about 53 kDa , 
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preferably about 5 0 kDa, and an amino acid sequence 
corresponding to SEQ. ID. No. 7 as provided above. 

Also encompassed by the present invention are 
fragments of the DNA molecules of the present invention. 
These fragments are constructed by using appropriate 
restriction sites, revealed by inspection of the DNA 
molecules sequence to, for example, delete various internal 
portions of the encoded protein. Alternatively, the 
sequence can be used to amplify any portion of the coding 
region, such that it can be cloned into a vector supplying 
both transcription and translation start signals. 

Variants may also (or alternatively) be modified 
by, for example, the deletion" or addition of nucleotides 
that have minimal influence on the properties, secondary 
15 *' structure, and hydropathic nature of the encoded 

polypeptide. For example, the nucleotides encoding a 
polypeptide may be altered so that the encoded polypeptide 
is conjugated to a linker or other sequence for ease of 
synthesis, purification, or identification of the 
20 polypeptide. 

The protein or polypeptide of the present 
invention is preferably produced in purified form 
(preferably, at least about 80%, more preferably 90%, pure) 
by conventional techniques. Typically, the protein or 
25 polypeptide of the present invention is isolated by 

homongenizing a host cell in which the protein is expressed, 
centrifuging to remove cellular debris, and precipitating 
the desired protein, such as with ammonium sulfate. The 
fraction containing the proteins of the present invention 
can be subjected affinity chromatography, ion ■ exchange , or 
gel filtration to separate the protein. Optionally, the 
protein can be further purified by high performance liquid 
chromatography ("HPLC'Mor fast protein liquid chromatography 
( "FPLC" ) . 



30 
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Any one of the DNA molecules encoding for a 
protein or polypeptide selectively expressed in chondrocytes 
in lower proliferative or upper hypertrophic zones of long 
bone and embryonic vertebrae growth plates can be 
5 incorporated in cells using conventional recombinant DNA 

technology. Generally, this involves inserting the selected 
DNA molecule into an expression system to which that DNA 
molecule is heterologous (i.e. not normally present) . The 
heterologous DNA molecule is inserted into the expression 
10 system or vector in proper orientation and correct reading 
frame. The vector contains the necessary elements for the 
transcription and translation of the inserted protein-coding 
sequences . 

U.S. Patent No. 4,237,224 to Cohen and Boyer, 
r l5 which is hereby incorporated by reference, describes the 

production of expression systems in the form of recombinant 
plasmids using restriction enzyme cleavage and ligation with 
DNA ligase. These recombinant plasmids are then introduced 
by means of transformation and replicated in unicellular 

2 0 cultures including procaryotic organisms and eukaryotic 

cells grown in tissue culture, 
j Recombinant genes may also be introduced into 

viruses, such as vaccina virus. Recombinant viruses can be 
generated by transfection of plasmids into cells infected 

25 with virus . 

Suitable vectors include, but are not limited to, 
the following viral vectors such as lambda vector system 
gtll, gt WES.tB, Charon 4, and plasmid vectors such as 
pRO-EX (Gibco/BRL) , pBR322, pBR325, pACYC177, pACYC184, 

30 pUC8, pUC9, pUC18, pUC19, pLG339, pR290, pKC37, pKClOl, SV 

40, pBluescript II SK +/- or KS +/- '(see "Stratagene Cloning 
Systems" Catalog (1993) from Stratagene, La Jolla, Calif, 
which is hereby incorporated by reference), pQE , pIH821, 
pGEX, pET series (see F.W. Studier et . al . , "Use of T7 RNA 

3 5 Polymerase to Direct Expression of Cloned Genes," Gene 



3NSDOCI0: <WO 9801468A1 J_> 



WO 98/01468 



PCTAJS97/11311 



- 26 - 

Expression Technology vol. 185 (1990), which is hereby 
incorporated by reference) and any derivatives thereof. 
Recombinant molecules can be introduced into cells via 
transformation, particularly transduction, conjugation, 
5 mobilization, or electroporation . The DNA sequences are 

cloned into the vector using standard cloning procedures in 
the art, as described by Maniatis et al . , Molecular Cloning: 
A Laboratory Manual , Cold Springs Laboratory, Cold Springs 
Harbor, New York (1982), which is hereby incorporated by 
10 reference. 

A variety of host-vector systems may be utilized 
to express the protein-encoding sequence (s) . Primarily, the 
vector system must be compatible with the host cell used. 
Host -vector systems include but are not limited to the 

15 r ' following: bacteria transformed with bacteriophage DNA, 
plasmid DNA, or cosmid DNA; microorganisms such as yeast 
containing yeast vectors; mammalian cell systems infected 
with virus (e.g., vaccinia virus, adenovirus, etc.) or 
stably transfected with an expression vector,- and insect 

20 cell systems infected with virus (e.g., baculovirus) . The 
expression elements of these vectors vary in their strength 
and specificities. Depending upon the host -vector system 
utilized, any one of a number of suitable transcription and 
translation elements can be used. 

25 Different genetic signals and processing events 

control many levels of gene expression (e.g., DNA 
transcription and messenger RNA {mRNA) translation) . 

Transcription of DNA is dependent upon the 
presence of a promoter which is a DNA sequence that directs 

30 the binding of RNA polymerase and thereby promotes mRNA 

synthesis. The DNA sequences of eukaryotic promoters differ 
from those of prokaryotic promoters. Furthermore, 
eucaryotic promoters and accompanying genetic signals may 
not be recognized in or may not function in a procaryotic 
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system, and, further, procaryotic promoters are not 
recognized and do not function in eucaryotic cells. 

Similarly, translation of mRNA in prokaryote 
depends upon the presence of the proper procaryotic signals 
5 which differ from those of eukaryotes. Efficient 

translation of mRNA in prokaryote requires a ribosome 
binding site called the Shine-Dalgarno ("SD") sequence on 
the mRNA. This sequence is a short nucleotide sequence of 
mRNA that is located before the start codon, usually AUG, 
10 which encodes the amino- terminal methionine of the protein. 
The SD sequences are complementary to the 3 ' -end of the 16S 
rRNA (ribosomal RNA) and probably promote binding of mRNA to 
ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing 
3 15 gene expression, see Roberts and Lauer, Methods in 

EnzymoloQv , 68:473 (1979), which is hereby incorporated by 
reference. 

Promoters vary in their "strength" (i.e. their 
ability to promote transcription) . For the purposes of 

20 expressing a cloned gene, it is desirable to use strong 

promoters in order to obtain a high level of transcription 
and, hence, expression of the gene. Depending upon the host 
cell system utilized, any one of a number of suitable 
promoters may be used. For instance, when cloning in E. 

25 coli, its bacteriophages, or plasmids, promoters such as the 
T7 phage promoter, lac promoter, trp promoter, recA 
promoter, ribosomal RNA promoter, the P R and P L promoters of 
coliphage lambda and others, including but not limited, to 
2acUV5, ompF, bla, lpp, and the like, may be used to direct 

30 high levels of transcription of adjacent DNA segments. 

Additionally, a hybrid trp-2acUV5 ( tac) promoter or other E . 
coli promoters produced by recombinant DNA or other 
synthetic DNA techniques may be used to provide for 
transcription of the inserted gene . 
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Bacterial host cell strains and expression vectors 
may be chosen, which inhibit the action of the promoter 
unless specifically induced. In certain operon, the 
addition of specific inducers is necessary for efficient 
5 transcription of the inserted DNA. For example, the lac 
operon is induced by the addition of lactose or IPTG 
( isopropylthio-beta-D-galactoside) . A variety of other 
operon, such as trp, pro, etc., are under different 
controls . 

10 Specific initiation signals are also required for 

efficient gene transcription and translation in procaryotic 
cells. These transcription and translation initiation 
signals may vary in "strength" as measured by the quantity 
of gene specific messenger RNA and protein synthesized, 

15 s ' respectively. The DNA expression vector, which contains a 
promoter, may also contain any combination of various 
"strong" transcription and/or translation initiation 
signals. For instance, efficient translation in E. coli 
requires a Shine-Dalgarno ("SD") sequence about 7-9 bases 5' 

20 to the initiation codon (ATG) to provide a ribosome binding 
site. Thus, any SD-ATG combination that can be utilized by 
host cell ribosomes may be employed. Additionally, any SD- 
ATG combination produced by recombinant DNA or other 
techniques involving incorporation of synthetic nucleotides 

25 may be used. 

Once the desired isolated DNA molecule encoding an 
isolated protein or polypeptide selectively expressed in 
chondrocytes in lower proliferative or upper hypertrophic 
zones of long bone and embryonic vertebrae growth plates has 

30 been cloned into an expression system, it is ready to be 
incorporated into a host cell. Such incorporation can be 
carried out by the various forms of transformation noted 
above, depending upon the vector/host cell system. Suitable 
host cells include, but are not limited to, bacteria, virus, 

35 yeast, mammalian cells, and the like. 
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Generally there are numerous genes differentially 
expressed within the, growth plate. However, genes 
selectively expressing proteins or polypeptides in 
chondrocytes of lower proliferative or upper hypertrophic 
zones of long bone and embryonic vertebrae growth plates are 
very rare. In view of the present invention's determination 
of nucleotide sequences corresponding to proteins which are 
selectively expressed in chondrocytes in lower proliferative 
or upper hypertrophic zones, and further in view of the 
importance of lower proliferative or upper hypertrophic zone 
chondrocytes in normal bone development and the deleterious 
affects of chondrocytes proliferation and hypertrophy in- 
certain osteopathic syndromes, such as arthritis, the 
molecular basis for chondrocyte proliferation and 
hypertrophy is suggested. With this information and the 
above -described recombinant DNA technology, a wide variety 
of therapeutic and prophylactic agents for inducing or 
preventing chondrocyte transition from proliferation to 
hypertrophy can be developed. In addition, the present 
invention permits the development of diagnostic procedures 
for identifying the occurrence of proliferation or 
hypertrophy or the transition of chondrocytes from 
proliferation to hypertrophy in a tissue sample. 

For example, the proteins or polypeptides of the 
present invention can be used to raise antibodies or binding 
portions thereof. These antibodies are useful in diagnostic 
assays for the identification of the occurrence of 
proliferation or hypertrophy of chondrocytes in a tissue 
sample . 

Antibodies suitable for use in identifying the 
occurrence of proliferation or hypertrophy of chondrocytes 
in a tissue sample can be monoclonal or polyclonal . 
Monoclonal antibody production may be effected by techniques 
which are well-known in the art. Basically, the process 
involves first obtaining immune cells (lymphocytes) from the 
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spleen of a mammal (e.g., mouse) which has been previously- 
immunized with the antigen of interest (i.e. the protein or 
peptide of the present invention) either in vivo or in 
vitro. The ant ibody- secreting lymphocytes are then fused 
5 with (mouse) myeloma cells or transformed cells, which are 

capable of replicating indefinitely in cell culture, thereby 
producing an immortal, immunoglobul in- secret ing cell line. 
The resulting fused cells, or hybridomas, are cultured and 
the resulting colonies screened for the production of the 

10 desired monoclonal antibodies. Colonies producing such 

antibodies are cloned, and grown either in vivo or in vitro 
to produce large quantities of antibody. A description of 
the theoretical basis and practical methodology of fusing 
such cells is set forth in Kohler and Milstein, Nature 

15 256:495 (1975), which is hereby incorporated by reference. 

Mammalian lymphocytes are immunized by in vivo 
immunization of the animal (e.g., a mouse) with one of the 
proteins or polypeptides of the present invention. Such 
immunizations are repeated as necessary at intervals of up 

20 to several weeks to obtain a sufficient titer of antibodies. 
Appropriate solutions or adjuvants are used as carriers. 
Following the last antigen boost, the animals are sacrificed 
and spleen cells removed. 

Fusion with mammalian myeloma cells or other 

25 fusion partners capable of replicating indefinitely in cell 
culture is effected by standard and well-known techniques, 
for example, by using polyethylene glycol (PEG) or other 
fusing agents (See Milstein and Kohler, Eur. J. Immunol . 
6:511 (1976) , which is hereby incorporated by reference) . 

30 This immortal cell line, which is preferably murine, but may 
also be derived from cells of other mammalian species, 
including but not limited to rats and humans, is selected to 
be deficient in enzymes necessary for the utilization of 
certain nutrients, to be capable of rapid growth and to have 

35 good fusion capability. Many such cell lines are known to 
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those skilled in the art, and others are regularly 
described. 

Procedures for raising polyclonal antibodies are 
also well known. Typically, such antibodies can be raised 
5 by administering one of the proteins or polypeptides of the 
present invention subcutaneously to New Zealand white 
rabbits which have first been bled to obtain pre - immune 
serum. The antigens can be injected at a total volume of 
100 (il per site at six different sites. Each injected 

10 material will contain synthetic surfactant adjuvant pluronic 
polyols, or pulverized acrylamide gel containing the protein 
or polypeptide after SDS -polyacrylamide gel electrophoresis. 
The rabbits are then bled two weeks after the first 
injection and periodically boosted with the same antigen 

fs three times every six weeks. A sample of serum is then 

collected 10 days after each boost. Polyclonal antibodies 
are then' recovered from the serum by affinity chromatography 
using the corresponding antigen to capture the antibody. 
Ultimately, the rabbits are euthanized with pentobarbitol 

20 150 mg/Kg IV. This and other procedures for raising 

polyclonal antibodies are disclosed in E. Harlow, et . al . , 
, editors, Antibodies: A Laboratory Manual (1988), which is 
hereby incorporated . by reference. 

In addition to utilizing whole antibodies, the 

25 processes of the present invention encompass use of binding 
portions of such antibodies. Such antibody fragments can be 
made by conventional procedures, such as proteolytic 
fragmentation procedures, as described in J. Goding, 
Monoclonal Antibodies: Principles and Practice , pp. 98-118 

30 (New York: Academic Press (1983) , which is hereby 
incorporated by reference. 

A variety of different types of assay systems can 
be used in practicing the method of the present invention. 
In one embodiment, the assay system has a sandwich or 

35 competitive format. Examples of suitable assays include an 
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enzyme-linked immunoadsorbant assay, a radioimmunoassay, a 
gel diffusion precipitation reaction assay, an 
immunodiffusion assay, an agglutination assay, a fluorescent 
immunoassay, a protein A immunoassay, or an 
Immunoelectrophoresis assay. 

In an alternative diagnostic embodiment of the 
present invention, the nucleotide sequences of the isolated 
DNA molecules of the present invention may be used as a 
probe in nucleic acid hybridization assays for identifying 
the occurrence of chondrocytes proliferation or hypertrophy 
in a tissue sample. The nucleotide sequences of the present 
invention may be used in any nucleic acid hybridization 
assay system known in the art, including Southern Blots 
(Southern, J . Mol . Biol . , 98:508 (1975), which is hereby 
incorporated by reference); Northern Blots (Thomas et al . , 
Proc . Nat'l Acad. Sci . USA , 77:5201-05 (1980), which is 
hereby incorporated by reference) ; RNAase protection assay 
systems (Yang et al . , Dev. Biol . , 135:53-65 (1989) ("Yang"), 
which is hereby incorporated by reference) , and Colony blots 
(Grunstein et al . , Proc. Nat'l Acad, Sci. USA , 72:3961-65 
(1975) , which is hereby incorporated by reference) . 
Alternatively, the isolated DNA molecules of the present 
invention can be used in a gene amplification detection 
procedure (e.g., a polymerase chain reaction) . See 
H.A. Erlich et. al . , "Recent Advances in the -Polymerase 
Chain Reaction", Science 252:1643-51 (1991), which is hereby 
incorporated by reference. 

More generally, the molecular basis suggested 
herein for the transition of chondrocytes from proliferation 
to hypertrophy can be used to prevent chondrocytes from 
transitioning from proliferation to hypertrophy. This 
transition can be prevented by reducing expression of the 
protein or polypeptide of the present invention in the 
chondrocytes, such as, for example, by introducing an 
antisense or ribozyme construct into the cell. An antisense 
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construct blocks translation of mRNA-encoding the protein or 
polypeptide of the present invention, thereby reducing 
expression of the protein. A ribozyme construct cleaves the 
mRNA encoding the protein or polypeptide of the present 
invention, thus, also preventing expression of functional 
protein. In addition, for decreasing in vivo expression of 
the protein or the polypeptide of the present invention, 
various gene therapy techniques can also be utilized to 
introduce the antisense or ribozyme construct into the 
chondrocytes. Details regarding the introduction of 
antisense or ribozyme construct into cells for gene therapy 
can be found in, for example, Christof f ersen, J. Medicinal 
Chemistry , 38:2023-2037 (1995), Rossi, British Medical 
Bulletin , 51:217-225 (1995), and Kiehntopf et al . , Lancet . 
345 (8956) : 1027-1031 (1995), which are hereby incorporated by 
reference. 

This technology can also be used to treat a wide 
variety of diseases caused by undesired chondrocyte 
proliferation or hypertrophy or undesired chondrocytes 
transition from proliferation to hypertrophy. For example, 
by reducing expression of the protein or polypeptide of the 
present . invention in the chondrocytes, arthritic progression 
of articular chondrocytes can be inhibited. This is 
achieved by administering to a patient an effective amount 
of an antibody, binding portion thereof, or probe 
recognizing proteins or polypeptide selectively expressed in 
chondrocytes in lower proliferative or upper hypertrophic 
zones of long bones and embryonic vertebrae growth plates. 
The antibody, binding portion thereof, or probe can be 
administered orally, parenterally, for example, 
subcutaneously, intravenously, intramuscularly, 
intraperitoneally, by intranasal instillation, or by 
application to mucous membranes, such as, that of the nose, 
throat, and bronchial tubes.. They may be administered alone 
or with suitable pharmaceutical carriers, and can be in 
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solid or liquid form such as, tablets, capsules, powders, 
solutions, suspensions, or emulsions. 

The solid unit dosage forms can be of the 
conventional type. The solid form can be a capsule, such as 
5 an ordinary gelatin type containing the antibodies or 

binding portions thereof of the present invention and a 
carrier, for example, lubricants and inert fillers, such as 
lactose, sucrose, or cornstarch. In another embodiment, 
these compounds are tableted with conventional tablet bases 

10 such as lactose, sucrose, or cornstarch, in combination with 
binders, like acacia, cornstarch, or gelatin, disintegrating 
agents, such as cornstarch, potato starch, or alginic acid, 
and a lubricant, like stearic acid or magnesium stearate. 

The antibodies or binding portions thereof of this 

15 * invention can also be administered in injectable dosages by 
solution or suspension of these materials in a 
physiologically acceptable diluent with a pharmaceutical 
carrier. Such carriers include sterile liquids, such as 
water and oils, with or without the addition of a surfactant 

20 and other pharmaceut ically acceptable adjuvants. 

Illustrative oils are those of petroleum, animal, vegetable, 
or,, synthetic origin, for example, peanut oil , soybean oil, 
or mineral oil. In. general, water, saline, aqueous dextrose 
and related sugar solution, and glycols, such as propylene 

25 glycol or polyethylene glycol, are preferred liquid 
carriers, particularly for injectable solutions. 

For use as aerosols, the antibodies or binding 
portions thereof of the present invention in solution or 
suspension may be packaged in a pressurized aerosol 

30 container together with suitable propellants, for example, 
hydrocarbon propellants like propane, butane, or isobutane 
with conventional adjuvants. The materials of the present 
invention also may be administered in a non-pressurized 
form, such as in a nebulizer or atomizer. 
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The present invention can also be used for 
treating bone growth defects, such as non-union bone 
defects, by increasing expression of a protein or a 
polypeptide which is expressed selectively in chondrocytes 
5 in lower proliferative or upper hypertrophic zones of long 
bone and embryonic vertebrae growth plates . This can be 
achieved by administering an effective amount of a protein 
or polypeptide of the present invention to the patient 
suffering one or more of these conditions. Alternatively, 

10 these conditions can be treated by administering an 

effective amount of an expression system comprising a DNA 
molecule encoding a protein or polypeptide of the present 
invention to the patient. The proteins and expression 
systems used to treat these bone growth defects can be 

^15 administered by the routes and in the forms discussed above 
with respect to administration of antibodies. 

The biological role of the protein, though not 
known for certain, is believed to be that of a phosphatase, 
although the disclosure of this biological role is not 

20 intended to be in any way limiting and should not be 

construed as a limitation on the uses to which this protein 
I,, may be put . In view of the potential phosphatase activity, 
specific inhibitors or activators of this putative 
phosphatase can be used to treat the diseases outlined 

25 above . 

The following examples are provided to illustrate 
embodiments of the present invention but are by no means 
intended to limit its scope. 



3 0 EXAMPLES 

Example 1 -.- Materials and Methods 

Growth. Plate and Articular Chondrocyte Isolation. 

Chondrocytes were isolated as described in O'Keefe et al . , 
35 J. Bone and Joint Surg... , 71A:607-620 (1989), which is hereby 
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incorporated by reference. Briefly, 3 to 5 week old chicks 
were sacrificed in a C0 2 canister, and the long bones of the 
legs dissected free of soft tissue. Cartilaginous tissue 
from both the proximal and distal growth plates of both long 
bones of each leg, or of the knee joint articular surfaces, 
were dissected and placed in modified F-12 medium 
(magnesium-free, 0.5 mM CaCl 2 , penicillin 100 units/ml, 
streptomycin 100 mg/ml) and sequentially digested with 
trypsin, hyaluronidase , and collagenase as described. The 
washed cells were either extracted directly for RNA or 
plated at subconf luent density in Dulbecco' s Minimal 
Essential Medium ( " DMEM " ) with 5% fetal bovine serum. 

Sternal Chondrocyte Isolation. Cranial and caudal 
sternal chondrocytes were isolated and cultured as described 
in Leboy, which is hereby incorporated by reference. Cells 
were released from the cranial and caudal thirds of 
embryonic day 14 chick sterna by trypsin digestion and 
cultured under standard conditions for 5 days. At the end 
of this primary culture period, the floating cell population 
was greater than 95% chondrocytic and was placed in 
secondary culture with DMEM plus 10% NuSerum (Sullivan, 
which is hereby incorporated by reference.) For culture 
under serum-free conditions, the secondary cultures were 
switched after 24 hours to DMEM supplemented with 60 ng/ml 
insulin and 10 pM tri - iodothyronine (Bohme et. al . , J. Cell 
Biol . , 116:1035-42 (1992), which is hereby incorporated by 
reference) . The ascorbate concentration in test cultures 
was increased gradually to prevent dedif f erentiat ion of the 
cells . 

RNA Isolation. RNA was purified by extraction 
with RNAzol B (Tel -Test, Inc.) according to the 
manufacturer's directions. Uncultured chondrocytes were 
collected by centrif ugat ion (1500g, 5 min) , washed in 
phosphate-buffered saline ("PBS"), and respun. RNAzol B was 
added to the cell pellet in the amount of 0.2 ml per 10 6 
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cells and immediately mixed by vortexing. Cultured 
chondrocytes were washed twice with cold PBS, then extracted 
with 2.5 ml RNAzol B per 100 mm dish by passage through a 
pipette. Yields of RNA were approximately 5 pig total RNA 
per million growth plate chondrocytes, 2-3 pig RNA per 
million articular chondrocytes, and 20 pig RNA per million 
sternal chondrocytes. Fresh growth plate tissue was frozen 
and then pulverized with a mortar and pestle in liquid 
nitrogen. The pulverized tissue was then extracted by 
mincing with a Polytron in RNAzol on ice. Poly A+ RNA was 
prepared by two consecutive passes of the RNA over an oligo 
dT-cellulose column as described in Maniatis et al . , 
Molecular Cloning: A Laboratory Manual , New York : Cold 
Spring Harbor Press, (1982) ("Maniatis"), which is hereby 
incorporated by reference) , reextracted with organic 
solvents, and precipitated with ethanol . 

RNA Blot Analysis. RNA analysis on Northern Blots 
was performed using morpholinepropanesulf onic acid ("MOPS") 
(200 mM MOPS, 50 mM NaOAc, lOmM EDTA, pH 7 . 0) -buf f ered 
formaldehyde (2.2 M) agarose gels as described in Maniatis, 
which is hereby incorporated by reference. 5-10 pig of total 
RNA or 0.5 pig of polyA+ RNA was denatured in 
formamide/ formaldehyde and electrophoresed . The gel was 
stained with 0.25 pig/ml Ethidium bromide for 5 minutes, 
destained for 1 hr with several changes of distilled water, 
and photographed, and the RNA was transferred to Gene Screen 
Plus (DuPont- NEN, Boston, MA) using an overnight capillary 
transfer with 10X SSC. rRNA bands and size standards were 
visualized on the paper (via Ethidium Bromide staining) , and 
their locations were marked for reference after 
autoradiography. 

RNA blots were stripped according the 
manufacturer's instructions (DuPont- NEN, Boston, MA). 
Chicken glyceraldehyde- 3 -phosphate dehydrogenase ( " GAPDH" ) 
was used as a probe to standardize loading for Northern and 
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RNAase protection analyses. The chicken GAPDH was cloned 
out of the growth plate cDNA library using the rat GAPDH 
fragment (Ambion) as a probe. The chicken GAPDH sequence 
used as a probe corresponds to nucleotides 265-533 of the 
5 rat GAPDH cDNA (Genbank accession number M17701) . For the 
experiment in Figure 4, a 1.45 kb human /3-actin cDNA was 
used as control (Gunning et al . , Mol . Cell Biol . , 3:787-795 
(1983) , which is hereby incorporated by reference) . 

RNAase protection assays. DNA fragments that 

10 served as templates for riboprobe production were cloned 

into either the SK" or SK* Bluescript vectors (Stratagene) . 
RNA probes were synthesized to a specific activity of 1 x 10 s 
dpm/jig in the presence of (alpha- 32 P) uridine triphosphate 
("DTP") using T7 or T3 RNA polymerase (Yang, which is hereby 

15 * incorporated by reference) . 

Growth plate or articular chondrocyte RNA and 
yeast tRNA were hybridized with an excess of the 32 P- labeled 
probe (300 pg) in a volume of 20 y.1 at 50°C in 50% 
f ormamide/4 0 mM 1 , 4 -piperazinebis (ethane-sulf onic acid 

20 ("PIPES"), pH 6.7/0.5 M NaCl/l mM EDTA for 16-20 hours. The 
RNA: RNA hybrids were treated with RNAase s A and Tl , 
extracted with phenol/chloroform, precipitated, and then 
collected by centrifugation. Protected RNA fragments were 
separated on 4 or 5% polyacrylamide gels, then displayed by 

25 autoradiography. 

Differential display of growth plate and articular 
chondrocyte gene expression. Following the original 
protocol described in Liang et al . , Science , 257:967-971 
(1992) ("Liang"), which is hereby incorporated by 

30 reference) , polyA + RNA from articular and growth plate 

chondrocytes was collected and validated by Northern Blot 
hybridization to type II and type X collagen probes. 0.5 pig 
polyA* RNA was reverse transcribed using Superscript reverse 
transcriptase (Gibco/BRL) , and 2.5 /iM T U CA as a primer, in a 

35 volume of 20 pil . Two fil of the cDNA was then amplified 
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using 2.5 units of Tag polymerase (Promega) with 2 0 dNTP 
and Q.5jiM ( alpha - 35 S ) dATP in a volume of 20 jil . The PCR 
conditions were: 1) 94°C for 30 sec, 42°C for 1 min, 72°C 
for 30 sec for 40 cycles and 2) 94°C for 30 sec, 42°C for 1 
min, 72 °C for 5 min for 1 cycle. Two m1 of this RT-PCR mix 
was electrophoresed on a 6% denaturing acrylamide gel, and 
the amplified bands were displayed by autoradiography of the 
dried gel . 

The differentially amplified Band 17 was recovered 
by a method suggested by P. Liang. The area of the gel that 
corresponded to the differentially expressed band was 
excised with a scalpel,' placed into 200 pil water for 15 min 
at 22°C, then incubated at 100°C for 15 min. After 
microfuging 10 minutes, the supernatant was transferred to 
another tube, glycogen was added to 400 fig/ml, sodium 
acetate to 0.3M, and 3 volumes of ethanol was used to 
precipitate the DNA overnight at -70°C. The primary 
amplified bands were recovered by centrif ugation . The dried 
DNA pellet was resuspended in 15 /xl lOmM Tris-lmM EDTA <TE) . 

Reamplif ication of the differentially expressed 
cDNA was performed with primers that had restriction sites 
added to the original T n CA and 10-mer oligonucleotides. The 
original 3' end primer was S'-T u CA-3'; the primer for 
reamplif ication was 5 ' -CCGCGGATCCT^CA-3 ' , thus inserting a 
BamHI site in the amplified fragment. The original 5' end 
primer was 5 ' -CTTGATTGCC-3 ' ; the primer for reamplif ication 
was 5 ' -CCGCGAATTCCTTGATTGCC-3 ' , thus inserting . an EcoRI site 
at the other side of the amplified fragment. The yield from 
the second amplification is 150 to 300 ng DNA. The added 
restriction sites facilitated cloning into phagemid and M13 
vectors, which was done by standard protocols (Ausubel et 
al . , Current Protocols in Molecular Biology . New York: John 
Wiley and Sons (1987) ("Ausubel"), which is hereby 
incorporated by reference) . 
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In Situ Hybridization. Sections were treated with 
a modification of the protocol described in Angerer et al . , 
"In Situ Hybridization with RNA Probes: An Annotated 
Recipe, " in In Situ Hybridization: Applications to 
5 Neurobiology , Valentino, ed., New York:Oxford University 
Press, pp. 42-70 (1987) , which is hereby incorporated by 
reference. Tissue sections were treated for 3 0 min at 37°C 
with 1 ptg/ml proteinase K, washed and dipped in fresh 0.25% 
acetic- anhydride in 0.1 M triethanolamine (pH 8.0) for 10 

10 min. After dehydration through a series of ethanol washes, 
the sections were dried and hybridized overnight at 56°C in 
50% formamide, 0.3 M NaCl , 10m M Tris-Cl (pH 8.0), 1 raM 
EDTA, IX Denhardts solution, 10% Dextran sulfate, 0.5 mg/ml 
yeast tRNA, and 0.3 /zg/ml probe. Riboprobes were generated 

15 as above . 

The slides were washed twice in a solution 
containing 0.15 M NaCl, 0.015 M trisodium citrate ("IX SSC") 
for 10 min and once for 40 min. Slides were treated with 
RNAase A (20 fig /ml in RNAase buffer (0.5 M NaCl, 10 mM Tris- 

20 CI and 1 mM EDTA, pH 7.5) for 30 min. at 37°C, then passed 
through 30 minute washes of RNase buffer at 37°C, 0 . IX SSC 
at:,,room temperature, 0 . IX SSC at 68°C, and 0 . IX SSC at room 
temperature. The slides were dehydrated, dried, and coated 
with nitroblue tetrazolium ("NBT2") emulsion for 

25 autoradiography. Exposure times were 17 days. Slides were 
developed, counterstained with hematoxylin and eosin, and 
coverslipped with an organic solvent -based mounting 
solution, such as Permount . 

cDNA and genomic library screening. Double 

30 stranded DNA fragments were labeled with { alpha - 32 P -) dCTP 

(New England Nuclear) using the Megaprime random priming kit 
from Amerst&m according to the manufacturer's directions. 
Specific activities or -:.he various probes were 1.0- to 6.0 x 
10 a cpm/ptg . These probes were used for hybridization to 

35 Northern Blots, Southern Blots, and cDNA library filters, ac 
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a concentration of 0.5 to 1 x 10 s cpm/ml hybridization 
solution. 

Two chicken growth plate cDNA libraries and one 
chicken genomic library were used for obtaining Band 17 
sequences. In a typical screening, a library was plated at 
30,000 plaques per 150 mm petri plate. Phage DNA was 
immobilized on Colony Plaque Screen (Dupont-NEN, Boston, MA) 
and probed according to the manufacturers' instructions. 
Two filters were used per plate. Prehybridizat ion was 
performed for 1-3 hours in 5 ml of prehybridizat ion buffer 
per filter (6X SSC, 1% SDS, 5X Denhardt ' s solution, 10% 
Dextran sulfate., and 100 jig /ml denatured salmon sperm DNA) . 
Denatured, random-primed probe was added and the filters 
were hybridized 16-20 hours at 60°C. The final wash was in 
0.1X SSC, 0.1% SDS at 60°C. Autoradiography was carried out 
for 1-3 days at -70°C using two intensifying screens. 

Plaques hybridizing to the probe were purified 
through more rounds of screening. Phagemid cDNA was 
"Zapped" out employing an M13 helper phage R408 (Stratagene) 
according to the manufacturer's instructions. Phagemids 
harboring the largest overlapping inserts were selected for 
k- sequence analysis. Genomic DNA was recovered by preparation 
of lambda DNA {Ausubel, which is hereby incorporated by 
reference) and subsequent subcloning into the SK- vector. 

Sequence analysis. Sequence analysis was 
performed by the chain termination method described in 
Sanger, Proc. Nat. Acad. Sci . USA , 74:5463-5467 (1977), 
which is hereby incorporated by reference, as modified in 
Biggin et al . , Proc. Nat. Acad. Sci. USA , 80:3963-3965 
(1983), which is hereby incorporated by reference, for use 
with the ( alpha - 35 S -) dATP and T7 polymerase (Sequenase from 
U.S. Biochemical) . Sequences were read and recorded 
manually, then entered into a VAX computer and analyzed 
using the GCG programs ( Program Manual for the Wisconsin 
Package . Wisconsin : Genet ics Computer Group, (1994), which is 
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hereby incorporated by reference) . Comparison of Band 17 
sequence with the national data bank used the BLAST search 
program disclosed in Altschul et al . , J . Mol . Biol . , 
215:403-410 (1990), which is hereby incorporated by 
5 reference. 

Example 2 -- Identification of Band 17 

The differential display technique described in 
Liang, which is hereby incorporated by reference, was used 

10 to amplify cDNAs from growth plate and articular 

chondrocytes from juvenile chicks. PolyA'RNAs were prepared 
from enzymat ically released growth plate and epiphyseal 
chondrocytes and were used as a templates for reverse 
., transcription and subsequent PCR. Band 17 was originally 

15 " amplified as a 260 nucleotide cDNA that was displayed only 
in PCR products from growth plate chondrocytes. The cDNA 
was reamplified and cloned into Stratagene vector SK" to 
facilitate, further analysis. The 260 bp Band 17 cDNA 
detected two transcripts of 2.2 and 5.0 kb on Northern Blots 

20 of growth plate RNA (Figure 1A, probe II, Lane G) . Neither 
transcript was detectable on Northern Blots of articular 
chondrocyte RNA (Figure 1A, probe II, Lane A) . RNAase 
protection using the 260 nt RNA antisense probe confirmed 
that Band 17 is strongly expressed in growth plate 

25 chondrocytes (Figure IB, lane G) and undetectable in 
articular chondrocytes (Figure IB, lane A) . 

Example 3 -- Band 17 Transcripts 

As the cloning of Band 17 cDNA proceeded, 

30 additional transcripts of 6.2 kb and 1.7 kb were detected by 
Northern Blot hybridization of cDNA probes from the 5' end 
of Band 17 (Figure 1A, probe I) . The 6.2 kb transcript is 
significantly greater in abundance than the 5.0, 2.2, and 
1.7 kb transcripts and is the result of alternative splicing 

35 (see below, and Figure 5 for location of probes and splice 
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site) . cDNA probes from the 5' side of the alternative 
splice site detect the 6.2, 5.0, 2.2, and 1 . 7 kb transcripts 
(e.g., probe I in Figure 1A) . Probes from the alternative 
3' ends of Band 17 detect either the 5.0 and 2.2 kb 
transcripts (Figure 1A, probe II), the 6.2 kb (Figure 1A, 
probe IV) , or the 5.0 kb transcript. None of the Band 17 
transcripts are detectable in articular chondrocyte RNA 
(Figure 1A, Lanes A) . The 1.7 kb transcript was only 
detected by cDNA probes from the 5' side of the splice site, 
and may include additional 5' and/or 3' exons not yet 
cloned . 

RNAase protection demonstrates that the 6.2, 5.0, 
and 2.2 kb Band 17 transcripts show the same specificity for 
the growth plate (Figures IB and 1C) . The RNAase 
protections were performed with cRNAs that detect either the 
2.2 and 5.0 kb transcripts (probe II), the 5.0 transcript 
(probe III) , or the 6.2 kb transcript (probe IV) . Compared 
to expression in the growth plate, Band 17 is weakly 
expressed in kidney (K) , liver (L) , lung (N) , skin (S) , and 
spleen (P) . Expression was not detected in brain (B) , 
arcicular. chondrocytes (A) , heart (H) , and muscle (M) . 

Example 4 -- Band 17 Localization 

In situ hybridization demonstrated that Band 17 
message is restricted to the lower prolif erative/upper 
hypertrophic region of the juvenile growth plate (Figure 2, 
A-D) . A similar pattern of expression for Band 17 was seen 
in embryonic vertebrae, in which Band 17 is expressed at the 
border of proliferating and hypertrophic cells (Figure 2, E, 
F) . In contrast to the expression of type X collagen 
(Oshima; Leboy et al . , J. Biol . Chem . , 263:8515-8520 (1988); 
and Luvalle et al . , Dev . Biol . , 133:613-616 (1989), which 
are hereby incorporated by reference), Band 17 expression is 
not found throughout the hypertrophic zone. Band 17 was not 
detected elsewhere in the embryo, including developing limbs 
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that had no hypertrophic cells. This suggests not only that 
Band 17 is expressed specifically in chondrocytes destined 
for mineralization (Figure 1) but also that Band 17 is 
expressed in a spatially limited region where chondrocytes 
5 are exiting the cell cycle and beginning hypertrophic 

differentiation (Figure 2) . The role for Band 17 in the 
transition from proliferation to differentiation has been 
corroborated through the use of two chondrocyte culture 
model systems. 

10 

Example 5 -- Temporal Expression of Band 17 

Cultured upper sternal chondrocytes from late 
chick embryos have been widely used as an in vitro model of 
chondrocyte differentiation. Ascorbate treatment of 

15 " cultured sternal chondrocytes results in steady increase of 
type X collagen and alkaline phosphatase, eventually leading 
to calcification of the matrix. Type X mRNA and alkaline 
phosphatase activity both increase approximately 14 fold 
over nontreated controls during a 7 day period. 

20 Concomitantly, collagen types II and IX decrease gradually, 
showing a greater rate of decrease in cells treated with 
ascorbate (Leboy, which is hereby incorporated by 
reference) . Ascorbate induces the hypertrophic phenotype in 
these cells in a manner independent of ascorbate 's effect on 

25 collagen processing (Sullivan, which is hereby incorporated 
by reference) . Ascorbate induced Band 17 mRNA at least 5 
fold over a 2-3 day period (Figure 3) in chondrocytes 
cultured either with (lanes 3 and 4) or without (lanes 1 and 
2) serum. The increase in Band 17 message during short term 

3 0 culture suggests, as does the in situ hybridization data, 
that Band 17 functions during the initial stages of 
hypertrophy as opposed to the later mineralization state. 
Band 17 mRNA appeared to be induced slightly more than type 
X message over the same duration (Leboy, which is hereby 

35 incorporated by reference) , suggesting that Band 17 
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expression is initiated no later than the initiation of type 
X synthesis . 

Band 17 expression was also examined in monolayer 
cultures of juvenile (3 to 5 week old) chick chondrocytes, 
cells that are more differentiated than those found in 
embryonic chick sternum. Monolayer cultures of growth plate 
chondrocytes derived from juvenile chickens showed rapid 
increases in Type X collagen message and protein in the 24 
hours after plating. This effect was seen in cells derived 
from all zones of the growth plate, indicating that cells 
not normally expressing hypertrophic marker genes do so upon 
release from their matrix (O'Keefe et al . , J. Bone Mineral 
Res . , 9: 1713-1518 (1994) ("O'Keefe"), which is hereby 
incorporated by reference) . Band 17 expression increases 
during enzymatic release from the matrix (Figure 4A) . 
However, Band 17 expression decreased significantly during 
the first 24 hours of growth in culture, in contrast to type 
X expression (O'Keefe, which is hereby incorporated by 
reference) . Furthermore, Band 17 expression remained at low 
levels (Figure 4B) . During this same period, type X 
collagen remained elevated and constant, and type II 
collagen decreased (Figure 4C) . In a separate experiment 
using identical isolation and culturing conditions, alkaline 
phosphatase activity was shown to increase, then remain 
steady, while cellular proliferation decreased. Thus, many 
parameters of the hypertrophic phenotype are consistently 
found in these cells throughout the culture period while 
Band 17 expression is found only in the initial stages of 
culturing . 

In summary, four independent aspects of Band 17 
gene expression support the hypothesis that Band 17 is 
involved in the commitment of proliferating chondrocytes to 
hypertrophy. Band 17 expression: 1) is specific to growth 
plate chondrocytes; 2) is restricted to the lower 
prolif erative/upper hypertrophic zone of the growth plate; 
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3) is increased concomitantly with induction of hypertrophy 
in vitro; and 4) is independently regulated compared to 
hypertrophic marker genes . This pattern of expression 
places Band 17 in a limited group of genes that are 
5 expressed differentially within the growth plate. 

Example 6 -- Alternative Splicing of Band 17 

Figure 5 summarizes the known intron/exon 
structure of the Band 17 locus compiled from four sets of 
10 data: 1) probing RNA blots with Band 17 cDNAs (as detailed 
above) , 2) probing a genomic Southern Blots with Band 17 
cDNAs , 3) cloning and sequence analysis of overlapping cDNAs 
and 4) cloning and sequence analysis of a 12.5 kb genomic 
g f ragment . 

15 The splice sites have been identified by 

comparison of Band 17 cDNAs with genomic DNA sequence. The 
2.2, 5.0, and 6.2 kb transcripts share at least three exons 
at the 5' end of the mRNA, but the 6.2 kb transcript 
diverges from the 2.2 and 5.0 kb transcripts beyond the 3' 

20 end of exon C. The 5.0 and 2.2 kb transcripts have 

approximately 1 kb of common sequence at the 5' end of exon 
Dh The 3' end of 2.2 kb transcript is approximately ac the 
Ncol site in exon D (Figure 5), as cDNAs from exon D 3' to 
that site do not detect the shorter transcript. This 

25 results in exon D-short (D s , Figure 5) . The remainder of 
exon D is approximately 3 kb long and contains no open 
reading frames. The 3' end of exon D has been approximately 
mapped by an AATAAA consensus termination sequence and by 
genomic DNA fragments downstream of this site that do not 

30 detect the 5.0 kb transcript. 

The multiple transcripts detected with the Band 17 
cDNA probes could arise from duplicated, highly similar 
genes. This possibility was investigated by probing a 
genomic Southern Blot with a cDNA that spans a Bgl II site 

35 within exon D (Figure 6, probe V) . Sequence and restriction 
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analysis of cloned genomic DNA predicts that probe V should 
detect Bgl II fragments of 1.7 and 3.8 kb, and single EcoRI 
and Xba I fragments of 5.3 and 8.0 kb. Figure 6 
demonstrates that these fragments are the only ones detected 
by probe I. Similarly, probe IV, which is specific for the 
6.2 kb transcript, also detects single EcoRI, Bgl II, and 
Xba I fragments on a genomic Southern (Figure 6) that are 
distinct from those spanning exons B-D. 

Analysis of Band 17 cDNAs provides corroboration 
that the three Band 17 transcripts are derived from single 
gene. Multiple cDNA sequences that diverge at the splice 
point between the 2.2 and 5.0 transcripts (exons C/D) , and 
6.2 kb transcript (exons C/E) have been obtained. Sequence 
analyses of the independent cDNAs representing the three 
transcripts do not indicate variability that would suggest 
an additional gene as a source for one of the fragments. 
The 2.2 and 5.0 kb cDNAs overlap for approximately the first 
1000 bp of the exon D (Figure 5), and the 2.2, 5.0, and 6.2 
kb transcripts overlap for all of the exons 5' to the 
alternative splice site, which is at lease 600 bp. Were the 
different transcripts arising from a second locus, perfect 
^homology would be highly unlikely. 

Example 7 -- Proteins Encoded by Band 17 

Figure 7 displays the Band 17 cDNA with the 
predicted translation of the only significant open reading 
frame in the cDNA sequence. The predicted amino acid 
sequence is for the cDNA that corresponds to the 6.2 kb 
mRNA. The alternative splice site for the 6.2 and 5.0 kb 
transcripts is at position 587. In the 2.2 and 5.0 kb 
transcripts the sequence added by exon D begins 5'-TTGA-3', 
the last three nucleotides encoding a termination codon . 
Thus, the protein translated from the 2.2 and 5.0 kb 
transcripts is predicted to be 131 amino acids shorter at 
the Oterminal than the protein from the 6.2 kb transcript. 
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The program MOTIFS of the Wisconsin Computer group 
sequence analysis software matched the C-terminal of the 
longer protein, Ala-Asp-Glu-Leu-COOH, to a putative 
consensus sequence that targets and retains proteins to the 
5 luminal space of the endoplasmic reticulum (Munro et al . , 
Cell, 48:899-907 (1987), which is hereby incorporated by 
reference) . A number of different luminal proteins in 
vertebrates end in the similar Lys/His-Asp-Glu-Leu . The 
initial basic residue of this signalling tetrapeptide 

10 sequence is conserved in vertebrates, but an alanine at the 
N-terminal position can be found in a yeast protein. 
Furthermore, a number of luminal proteins, such as rat, 
chick, and human protein disulphide isomerase (Edman et al . , 
.? Nature, 317:267-270 (1985); Geetha-Habib et al . , Cell , 

15 J 54:1053-1060 (1988); and Cheng et al . , J. Biol. Chem . . 

262:11221-11227 (1987), which are hereby incorporated by 
reference), chick and mouse Hsp4.7 (Hirayoshi et al . , Mol . 
Cell . Biol . . 11:4036-4044 (1991) and Takechi et al . , Eur . J . 
Biochem. , 206:323-329 (1992), which are hereby incorporated 

20 by reference) and chick GRP94 (Kulomaa et al . , Biochemistry . 
25:6244-6251 (1986), which is hereby incorporated by 
reference) , have a bulky hydrophobic group as methionine or 
valine preceding the lysine, as does Band 17. 

25 Example 8 -- Band 17 Homology with a Human cDNA 

Comparison of the Band 17 sequence with NCBI data 
bands detected homology with two overlapping uncharacterized 
cDNA clones from infant human brain tissue (Figure 8A) . 
This homology is found within the protein coding sequence of 

3 0 Band 17 (Figure 8B) and extends into the sequences specific 
to the 6.2 kb cDNA. Translation of the two sequence 
predicts a high level of homology (70% identity) between the 
human and chicken genes. As yet there are no other 
significant homologies between these two sequences and any 

3 5 other nucleotide or amino acid sequences in the data banks. 
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However, the tight conservation between the chicken and 
human primary structure suggests that the function of the 
two proteins has been conserved. 

Although the invention has been described in 
detail for the purpose of illustration, it is understood 
that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without 
departing from the spirit and scope of the invention which 
is defined by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT : University of Rochester 
(ii) TITLE OF INVENTION: CHONDROCYTE PROTEINS 
(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Clinton . Square 

(B) STREET: P.O. Box 1051 

(C) CITY: Rochester 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 14603 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U.S. Provisional 
Serial No. 60/021,672 

(B) FILING DATE: July 5, 1996 

(C) CLASSIFICATION: 

( vi i i ) ATTORNEY/ AGENT INFORMATION : 
' ^ (A) NAME: Rogalskyj , Peter 

(B) REGISTRATION NUMBER: 3 8,601 

(C) REFERENCE / DOCKET NUMBER: 176/60092 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (716) 263-1634 

(B) TELEFAX: (716) 263-1600 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GATCACTGCG ACAAGTTCGT GGCCTTCGTG GAGGACAACG ACACAGCCAT GTACCAAGTG 6 0 

AACGCCTTCA AAGAGGGCCC GGAGATGAGG AAGGTGTTGG AGAAGGTGGC GAGTGCCCTG 12 0 

TGTCTGCCGG CCAGCGAGCT GAACGCAGGT AACAGAGCGG CCCCGGGTAC GCTGCGCTCA 180 

GTGTGATGCG GGATGTGCTG CAGTTATGCA GAGTTCCTGT CTAAAATACA AGCTGAACCA 240 

GATGCAGTCA TGCAGGGTTC GTGTGGGGCT GCAGTAGTGC GTGCTTGTTA GTCAACAGAA 3 00 

AGAAAACACC TTTGGGAGTA TCTTTCTTGG AGACGAGTGG AAGTATCAGC TGTACCTTTG 360 

TTTTAAGGGC TCAGCTTTAC TTTTG CTTTG AGTTATGAGT GTGTTACCTT TTAATTCTCC 420 

TTCTGTAAAA TGTTGCAATT CAAGCATGCA GATAGTTGAA GGGAAGGGAG GATGTGTCTG 4 80 

CGTTGTACCT TCGCTTGTCT ACAGGGAGCA CATTTCCCAT GCTCAGGAAG CCCCCAGAAA 54 0 

TAAGCACTGC TGTCATTTCC AGCATTCCCC CAAAGATGTG ATCCTAAAAC CACGTCACGC 600 

TGCAGCTCAA ACCCAGCCAG C AG C ATACAG GTTAAGCATG GCAGCCTGAG ACTGCTCCAC 66 0 

AGTGAGCCGG CACGCCTCCA CCTGCCCCTC TTCTGCCTTT TGTGATAGTA AGGCTATCCC 720 

AGCAGTGGGA CTATCACAGG TGCATCAGTT CAGTGTGGAA TGTGTGGTTT TGTTTCCCTG 78 0 

AGGTTTGCAT TCTGCACGAT AACTCTATTG GAAACTTTGT TGCTTGGCAT TTGGGCTGGT 84 0 

GATTGTTTTC AACCCTAAAT TGTAGTTACT CGTACAAAAC CATGACAAGG GGAAAGTTGG 900 

GAGAAAGTTG CTAGTTCTGT GGTGGTGGTT TTATCCCTTG CTCCTTTCTT GGATCTATTG 96 0 

CAGATCTCGT TCAAGTGGCT TTCCTCACTT GCTCGTATGA GTTGGCTATA AAAAATGTGA 10 2 0 

CCTCCCCGTG GTGTTCGCTC TTCAGTGAAG AAGATGCTAA GGTAGGTGCT AAATGCAGAG 10 8 0 

GGCAGAGAGA TTTGAGAAGC CTTCAAAACA TGCCTCACTG TTTGGATGTT GTTTTGTGGG 114 0 

CAGTTGTAAG TTCTGTGCCC GTCCTTCTTC AACCTTCATT AGGTTTGGTG CTCCATTAGC 12 0 0 

GCTGCATTGG TCTCCAAAGA GCTGTGGGTT AATCAAG C AG TAGGACTGAA ATACCTTCTG 12 6 0 

CATTCAGACT TAAATATTGG CAGTGTCTTA ATTTGTCCTG ACTAAAATGA TCTTTTCCAT 13 2 0 

TGCACACTTA ATTCATGTAA TGCTTTTTTC TTTCTGTAAC ACCTGAAATG CTCTGGACAA 13 8 0 

CTTTGTTTTA CATGTATTAT TTTTATATGA TAAAATGTCT TGATTTTAGA GGACAGCAAA 144 0 

TAAGGTCTTT TAGGTCCTCT GTGACTTCTT TTCTGAGGCC CAACTGGTCT CTAATTCCTG 1500 

TTAATAAAAC TAGTAGAACC TGGATAAATA TGACTTGCTT TGGATTACTC TTTGGAGGGA 1560 

TTGAGAGATT TGGGGATTAA GAATGATGCC ATTTATTTGG CACTGCAAAA CACGTTTAGC 162 0 

AATGCCCCTG CAGAGGCTCC TAAAGGAAGC TTAGCAGCCC TGCCAAAGAG AAAAACCCTG 16 8 0 

GAGTCAGGAG GAAGCGGTCT CCTCTCAAAG AAGAGGAGGG TCAGCAGGAA TTTGTGCTGT 174 0 

TTCCTTCTAA TAGCTTAGTG AGAGAGGAAA GCTTGCTGAT TAAGCGGTTA CTTGGCACGT 18 00 

TAAGAATATG GGGTGTTTGA GCAGCTCTGC TGGAAGACTC TACAAGGTTG AATTGCCCAG 1860 
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CAGTGCAGTG 


GCAGTTGGTG 


TTCAGTG TGA 


AATTACGTGC 


ATGGAGTAAG 


AGGTTAAAGC 


1920 


TCCATCAGTG 


AGGTGGTGGG 


CTCTCAGATC 


CCTTTTTATT 


ATTTATTTAT 


TTATTTTCAC 


1980 


TGTATG CAAT 


AGTAAAAACT 


TGTAAACTGT 


GTTAACTTTA 


GGTACTGGAG 


TACCTGAATG 


204Q 


ACCTGAAGCA 


ATACTGGAAG 


AGAGGATATG 


G CTATG AC AT 


CAATAGTCGC 


TCCAGCTGCA 


2100 


TTTTATTCCA 


GGATATCTTC 


CAGCAGTTGG 


ACAAAGCAGT 


GGATGAGAGC 


AGAAGGTAAA 


2160 


TTAAAAAAAA 


AAAAAGGGGG 


GGGGGGGGGG 


GAAGCTTTTG 


TGTTGACTGA 


CTGCAAGCTT 


2220 


TCTGTGGTTA 


ATCCTGAGTT 


GGATTTGAGT 


AGCAGTTAAA 


CACTTCAGAC 


ACAAGAATGC 


2280 


TAGGAGAAGT 


TTGGTTAGGA 


GAACTTGTGA 


TTAGAGAGAA 


CAAAATCCTT 


AATAGGATCG 


2340 


TTACTGTAGA 


GTGCAAATAG 


GCTTGAGGTT 


TTATTTTTCC 


CATTGATGCT 


TTTGTGC CCA 


2400 


GTGGATTTAT 


TTCCATCTTT 


TAACTTACTG 


ATCT GCACAG 


GCCTTCAAAG 


GACAGCCAGT. 


2460 


TACTGTGTCT 


GACAGTGGTG 


GTTTTTTCCT 


GCTGAACAAT 


GAATTTTTTG 


TTTAAAATGT 


2520 


CTTTGTTAAA 


AAGCATTTGT 


GGTGAAAGTG 


GAAAGGCTGT 


AGGTTAAAAA 


AAGCAATATG 


2580 


ATCGATTCTG 


CTTTCTGGTT 


ACTTAAACAC 


TTCAGCATGA 


AAGTCTTGTT 


TTCTTTC CAT 


2640 


GTGTGTTTGA 


CATCTCTTGC 


ACTATTAAAG 


CTTTCTGAGC 


TTTAAAGCTT 


CAGGCTGAAG 


2700 


GTGCTGAAAT 


GCAATTACAA 


AAGAATAATT 


ATTTCAAGTG 


AATCCAAACA 


CTCAGTGACC 


2760 


CTAGATGAGA 


ACTGCCTGTT 


GCAGAATCCA 


CCAAGCCTGA 


ACTGTAACAG 


CAAACCAGCC 


2820 


TTGTCATGCC 


TGCTTCTTTG 


TAACTGCAGA 


AAGACAAACT 


TAGGCAGTAT 


ACTCGGTCCC 


2880 


TGCACAAACA 


GGAGAAAGGT 


ACTTGAGCCC 


TGAGGCTGTT 


GTAAAAGCCT 


TGGTTTGTTG 


2940 


TACGAACATG 


AGGCCAGTAA 


TTTAGCCAGC 


CAGCCACTCT 


CTTAGATATT 


TACTTTCGCA 


3000 


TCCTTACTCA 


TCTGCAGCAA 


AACTG C C CAT 


TGGGAGCAAT 


GCTGTAGGTG 


TAGGAAGTTG 


3060 


TTAGACCTCA 


CATGTATCTG 


TTAGCAGACA 


CAAAGATAGC 


ACAAGCAAGA 


GTCTGCAGAG 


312 0 


GAGGGTGGTC 


TGATGAAGTG 


GTTTGTGTTC 


AGCTAGTTCC 


ATGGTTTGGC 


AAGTCATTTT 


3180 


GTGTCAGAGA 


AGGAAGAACA 


GCAGTGGTAC 


TCCTTCCAGG 


AACTCTTACA. 


GCCCTCAAAA 


3240 


TTGCCTTTAA 


CGTGCCTTGG 


AGGTACCTAT 


GCTTCCTTAA 


AAGCTAAAGA 


CAAGATGCCT 


3300 


GTGTTCTTGT 


GTGTATTGTT 


TACTCCTATC 


AGCTGCTATC 


AGTCGGCAGC 


GGTGATCTGT 


3360 


TGTAACCTAG 


AGAAAACAGT 


ATAGAAAACA 


AAGGCTTTAG 


TTACAGGTTT 


GGGTGTTTAT 


3420 


GTCACAAGAT 


TAGCTGTATT 


TGCTTTCATG 


TGCCAGTAAT 


AAAATTTTTG 


AGAGCTGCGT 


3480 


TAGGCTTAAA 


AACAGTGCAT 


GCATATGGGA 


ATAATTTACA 


ACCTGCATGA 


ATGTTGTTTT 


3540 


TCTAACAGAG 


GAATTACAAA 


TTCATAGCTT 


AGTGATCAGC 


CATGTGAATC 


AGTAC CTGAG 


3600 


CAGGTAAGCG 


CACAAATGTT 


TACAAAAGCA 


CACAAAATCA 


AGGAGGTGAT 


AACAAGATTG 


3660 


TGTAAACATT 


GTGCCTTTAA 


ATGGTTCGTT 


GGAATCAATG 


TATGAGT AG C 


GTAAGGTGAC 


3 72 0 


CAAGTTCAGC 


TTTGATATTG 


ATATAGAAAA 


AGTAGTTGTA 


TGTGATGGGT 


GTACTTACAT 


3780 



BNSDOCID: <WO 9801468A1_I_> 



WO 98/01468 



PCT/US97/11311 



- 53 - 

TGCTAGCATC CTTGGGGTTC TAGTTCTAAA TTTAGGGTAC TGAAGTAGGT CAAAAATTAT 384 

TTAGTGTTTC AGGAACGAAA GCTGAAGTCA CTGATACTTG AAGCTATATG TGTGTATTTT 3 900 

TTTTTACTTG ATAACATGTA AGAAAGCACT TTATTTTCCC CTGTCAGTTG ACAGATTGAA 3 960 

AATAGAGGTA GCCTTGCAAT TTTGGATCAG AGGAATGATC TATCAAATTG TGAAGTCTTC 4 02 0 

CTCCTTGGAA GAAAAGCTTC AAAAGCTGCC CTGGCACTAC CCTGGGATAC AGCCTCCAGA 408 0 

GGTCCCTTCC CACCTCAAGC ATTCTGTAAC GCCAATCACT TCTTACAAAG AGGACTGCGA 414 0 

AGAAGTTGTT CATCTAGATT TTTGCTCACT GAGGATCTGA GTTAAATATC AACAGTGATA 4 2 00 

GAACTGACTG TTAAGTCAGT TGAAGCAGAA TTCTCAGTCA GTTGGCTTTT TTGTTGTGCT 426 0 

TCAGTGCTGG ATGCAGAGAT GCTGTGTGTT AAGCCCTCTT CATTTTGCTA TGAACAGGCT 4320 

AGAACTTGTT GTAAGCTAGT TGTAAGCATG AAACCAACAT AGCACCGAGG ACTAATTGTG - 4380 

AAGGAAAGGT GGGCAGAAGG AAGTGGCTGT TGATAGCAAA CTCTCTGCAG CAAGCCTGGA 444 0 

CATTGTGCTG CTAAATCATT CTGGTTTTTG GAAATCTAAG GGCTGTCAGA GCTGTTGATC 45 0 0 

CCTCTCATTT TGAGAGTGGT GGAGTCAAAG CTGTGGTTAT GCTAGATTGC CCTTTAAATA 4 56 0 

AATCTCTACT GTATCCTTTC TTCAGCATTC TGGGAAGCTA AATAAAAAAT GCATGAGGCC 462 0 

ACAGGTCATT TACATCCAAC TGTGAAGAGA TTGACAAGCA CACTGCTGTG ATTGCTTCCA 46 8 0 

TATATGCTGT GTCTGCTTCT GCGAAGATAG AAAATATAAA CAGAATGAGG AGACGAAGAG 4 74 0 

CAGATTAAAA GTGAGCAGAC AAGCAGAGCA AAACCCCTCT GCCCTTCTGA AGGAAAAAAA 4 8 00 

AATAACTTCT TAATGTAGCT TGTCTCATAT AAGGAGAATA ATTAGATCTA TTTGCTTTTA 48 60 

GTGTATTTAT TCTATGAGCA GGGAAAGCCT TTAAATCCTT AAGTGCTACT TAGAAAATAG 4 92 0 

t CTTTAATTCT TAACTGTTTA TTAAGTCTGT AAGTTTAATA ATGATAAAGC TATAATTGAC 4 98 0 

AAAATCCACA TCTGTACTTC CAGTTT ATT G ACAGCTCATT CAGCAGCCCC TAAATTTCTT 504 0 

GGGAAGAGCA GGTGTTGGAG GCAGAGCAGT AAAAGATTGA GATGATCTCA TCCTGTCTTA 5100 

GAGCTTTGGC CATGGAATCA GAATCACAGA ATATCCCAAG TTTGGAGGGA TCTGTAAGGA 516 0 

TCATCGAGTC CAATTGTGAT GTTTAAAACA TGTCATTTAG CAATGAGGTG TTGAGGAGAA 522 0 

GCAGTGAAGG CCAGCAGATG GATGTCTGTC AGGATGGTCC CTCCTGGTCA CTGCTAGTCC 52 8 0 

CTTCTTGTTT GAAAGGAAAC ACCCAAAATC TCCACTGGTT AAAACTTGTC ACTAGAACCC 53 4 0 

ATCTAGGAGA GTCCTGAGCT TCTGCTGATA AGCTGTAAAA TCAATTGTGA TCAAACATGA 54 0 0 

TCACAAGTGA GACAATTCTA GGGATGCCTG GAGGGAAATG ACCCACAGAG GCCAAAATAC 546 0 

AGGTATACAA CTGGGGTTTT CTACCTAAAC TGAGGTGCTG AGAGTTTGAA CAGGCACCCT 5 52 0 

ACCCTATAAC ACCCTGTTGC TCACCATGGA TGGTGTTGCA ATCCTTTTGA ATTAAGCATG 558 0 

TGGCTCCATG AGGCTGGCAC CAGTAAGCCA GGACCTCCAA ATGACAGAGT ACAACTGATG 5 64 0 

GAATCACTGA GGTTTGAAGA CACCTCTAAG AC CATTGAG C CCAACCAGCT CATCCTTGAG 5 700 
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CTCCTGTGGC TGCCCTCAGA GCTGCTACAC CCTCATCTCT GTTCATTACC AGGTTGTGAT 5 76 0 

TATTTGGGAG GAAGCTTGCC TCCTCCTTCC AG C CAGGAG A GCCCTCTCAG AG C ATGGAAG 58 2 0 

CAATTAGTAT TTTCAGTCAA TCCAATATAT GCTGTCAGTC TGCAAATAGC CAACTAAACA 588 0 

ACATGCCAGC GTGCTGCCAT GCTGTCAGTC TGCAAATAGC CAACTAAACA ACTAGCCAGC 594 0 

GTGCTGCCAG TCCCCTTCTA CGGACTGCTG GTCTCCCAGG GATAACTTCA GGAAAGCTGT 6 00 0 

TTCATTTGGG AAAGTTATTC CATGGCATCT GCTGCAGGAC ATACAGCTGA GAGGGAGAAG 606 0 

TCCTCCCAAG CACAGGAGAA CATCTCCCAT CCTATGGAAG CACCGAATTG TG CAGGAGAT 612 0 

AACCAACTGA AAAACACAAA CTTACATCCT AACCCAGGGG ATCATCTCCA GTAGTCCAAT 618 0 

TTTTGATAGA CAAATGTAAG TACAAATTTA TGTCTGGTAA AAGCCAAGAA AATGGGTCAA 624 0 

GCAAAATTTA TCCAAAGCAC ATTGTCTGAA GAATGATGTG ATATATTCAG CAAAACCGAT 6 300 

GTCAAGAAAT TGACAGAAGT TTAAAATAAT AG CAGATGAC TTCAGAGATT TTCAGTGATT 63 6 0 

TCTGGAATAT ATTATAAAAG CAAAAATATT TGCACTGATC TGTGATATTT AAAGATGTAA 642 0 

d CTGGGAAGAA TCACTGTTCA GATGTGTTGT TGTTACCCCA GACAGAAGCA GGTAGTGAGT 64 8 0 

TTGTGCACAT GTGTGGAGAG TGGAGACCCT GGCAAAAAAT GGAGATCTGG CAAAATTCAA 6 54 0 

j AGCTGGGTGA GCAGCCTGCT TACCCTGTGT GTTCTAAAGT GGGGGCTGAA GGCATCTCAA 66 0 0 

ACTTACTGCC TTCTGCAAAA CGAGCATGTA ACCCCATCCC GCAACGTCAG GTGGCAGTAT 666 0 

TAAAG CACTG AAGGCTTGAG TACAGTCTCT ATTAGGCAAC CTGGTTCACT TAAAAGTAGG 6 72 0 

TGGAAATCTA CCACCACCAA TGTAGGAGAG CACCTTGTGT CTCTTCATCT GGGGAGTGGA 678 0 

GATACAACTA ACAATCCTTC ATCTAGGGAG GGAGACTTAT GTGGGGACCT GAAGCAATTT 6 84 0 

GAGAGTACAG CTGAGAACAA GAAACCATAC AAAAGGAAAA TATGCATATT TTTTAGCCGT 6 90 0 

AGAAAATACT TGGTTGTGTA TGCATGTGTT ATTATGACTA TATAGTGTTA TTACTATATC 6 96 0 

TTTAATGATA TAGTACAGTT CTGTATTTAA TCTGTTGCCC CACCTGCAGC TGTTAATTGC 7 02 0 

TCAGAAAATG AGCCTCTGTG GTGG CAAAAT GTTGTCTTAT TTATCCGTGT TTTAACACTG 70 8 0 

ATATATATCT CTGGTTTGTT CTGATACTAC AGGAAGAATG ATTTTATTTC CAGAATCTTA 714 0 

CTGTTGCTCC AAGTTCTCCT TTTTTTTTAA AAATGAAAAG TTTAGTTTGG GCTATCCAGT 72 0 0 

AGCAGCTGTT GGAGCATTTG TGCTCCAGCA AGGAGTTATG GTGTCTGGCT TTGTGTTTCT 72 6 0 

GTTCTAGGCT TGTTGGTAGA GAATGGCATT GCCAGCTCTG CATTTTATAG CATATTTCAA 73 2 0 

ATATTTATAT TTAGCAGTTT GCCC CGTTTT CATTCCTTGT TACAGCTCAA ATAAAATGAG 73 8 0 

AGCTTTTACT TGTAACCCTT TTTCTTCCAT GAAGCTTTTA TTGACCCAGC AATCTGATTT 74 4 0 

CTGATTATTT GCCTAATTAG TTGCCTTATT AAAGCT CACT CTTCTTTCTT CTGGAAAAAG 750 0 

TACCTTCTGG AATAATGTCG GCCCTTAAGA AAATGATGAA AATTACTGAA ATTCTCAAGA 75 6 0 

TTTTAACTAT GAGACCATTA GAGAGTTGGT ATTTGAGTTA CAACTTTGAT GTCTCAGATG 76 20 
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TGAATGTTTG 


GCGTCTCCAT 


TCTTCTGCAC 


CTTCAGTAGC 


AATAAAACAT 


TAATGTCCTG 


7680 


TAAAGGTTAA 


TTCCTTTTCT 


TTGAGACCTT 


ACCACTGTCA 


AATAGGTTCT 


TCCAAGACCA 


7740 


CATTCCTCTG 


TGTCTCCTTG 


CCTGTCTGTA 


AGGT GAT AC A 


GTGATAACGT 


GTCTGGGGAG 


7800 


AGTTTGAGTG 


CCACAACTCT 


CCCATAAAAA 


GTTTCTTATT 


TAGAAGAAAA 


AGGAAATAAT 


7860 


ATTATAGGAG 


TGGAGTAAAG 


TTAAACCAGG 


TGAGTTGTGC 


TAAAATGG C A 


TACTTGGGAA 


7920 


GTTGTCCAAG 


T C C AAAT AAA 


GAGCTTTATT 


TTTGTGATAA 


GGAAAGGATT 


AAATTCTTCT 


7980 


CATGTCTGTC 


CGTTATGGAT 


AGCCAACAAT 


CAGACCATGC 


AACTATATGG 


CAAAGAAGCC 


8 04 0 


AA 1 <j<j(j(j I AA 


TACTCTTCTC 


TGAACTGTTG 


GTTTTTTTCC 


ATACTGGAAC 


CTTACAGAAA 


8100 


ATGTCCCTAC 


TCTTCATTAT 


GTGGGCAAAA 


CTGACAGGTA 


GCGATGTGCT 


TGTACTGCTG 


8160 


CACTTGGCGT 


TGTGCTGCTA 


TGGAAGAATC 


TCGAAAGGCT 


GCTCTGCATT 


TGATTGAAGA 


8220 


GTTAGTGTCC 


AATTTCCCAC 


AGTTGTGGTA 


TTTGGAGGAA 


GTTTTAACAG 


TGGTACATAG 


8280 


AGGAGCAATA 


GATGAGTGTC 


TCTCTGCCTT 


GGAAGAAGCT 


T 




8321 



(2} INFORMATION FOR- SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 502 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

'tGGCACGAAGG GAGGCGAGAG GATCCCGGAG CAGCTGGAGC AGGCGGCCGC GCCCGTCCTC 60 

CTCTTCCTGC AGCTGCCGCC ATGGCGCCGT GCCGCGCTGC CTGTCGTCTG CCGCTTCTGG 12 0 

TAGCGGTGGC GAGCGCCGGG CTGGGCGGCT ACTTCGGCAC CAAGTCCCGC TACGAGGAGG 180 

TGAACCCGCA CCTGGCGGAG GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC 240 

TGCCCGCCGC CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC 3 00 

CCACGGCCGG GCAAATCCGC CGCCTGGCCG AGCTGCACGG CCGCCTCCGC CGCGCCGCCG 3 60 

CCCCGTCCTG CCCCGCCGCC GCCGCGCTGG CCGCCTGGCC GATGTGGTAC GAGGAGAGCC 420 

TCGACGGGCG GCTGGCGCCG CGGGGCCGCC GCGACATGGA ACACCTGGCG CGCCGCCTGG 480 

CCGCCCGCTT CCCCGCGCTC TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA 54 0 

AGCACCGCTG CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG CCTCGGGCCC TCCCTCAGCC 6 00 

TCGGCGCCGA CGAGACGGAG ATCGAAGTGA ACGACGCGCT GATGAGGTTT TTTGATCACT 66 0 

GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT 72 0 

TCAAAGAGGG CCCGGAGATG AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC 78 0 
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CGGCCAGCGA GCTGAACGCA GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT 8 40 

TGGCTATAAA AAATGTGACC TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 9 00 

TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC TATGACATCA 9 60 

ATAGTCGCTC CAGCTGCATT TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG 10 20 

ATGAGAGCAG AAGTTGACAG ATTGAAAATA GAGGTAGCCT TGCAATTTTG GATCAGAGGA 10 80 

ATGATCTATC AAATTGTGAA GTCTTCCTCC TTGGAAGAAA AGCTTCAAAA GCTGCCCTGG 114 0 

CACTACCCTG GGATACAGCC TCCAGAGGTC CCTTCCCACC TCAAGCATTC TGTAACGCCA 12 00 

ATCACTTCTT ACAAAGAGGA CTGCGAAGAA GTTGTTCATC TAGATTTTTG CTCACTGAGG 12 SO 

ATCTGAGTTA AATATCAACA GTGATAGAAC TGACTGTTAA GTCAGTTGAA GCAGAATTCT 13 20 

CAGTCAGTTG GCTTTTTTGT TGTGCTTCAG TGCTGGATGC AGAGATGCTG TGTGTTAAGC 13 80 

CCTCTTCATT TTGCTATGAA CAGGCTAGAA CTTGTTGTAA GCTAGTTGTA AGCATGAAAC 14 4 0 

CAACATAGCA CCGAGGACTA ATTGTGAAGG AAAGGTGGGC AG AAGGAAG T GGCTG TTG AT 15 00 

AGCAAACTCT CTGCAGCAAG CCTGGACATT GTGCTGCTAA ATCATTCTGG TTTTTGGAAA 15 60 

TCTAAGGGCT GTCAGAGCTG TTGATCCCTC TCATTTTGAG AGTGGTGGAG TCAAAGCTGT 162 0 

GGTTATGCTA GATTGCCCTT TAAATAAATC TCTACTGTAT CCTTTCTTCA GCATTCTGGG 16 8 0 

AAGCTAAATA AAAAATG CAT GAGGCCACAG GTCATTTACA TCCAACTGTG AAGAGATTGA 174 0 

CAAGCACACT GCTGTGATTG CTTCCATATA TGCTGTGTCT GCTTCTGCGA AGATAGAAAA IS 00 

TATAAACAGA ATGAGGAGAC GAAGAGCAGA TTAAAAG TG A GCAGACAAGC AGAGCAAAAC 18 60 

CCCTCTGCCC TTCTGAAGGA AAAAAAAATA ACTTCTTAAT GTAGCTTGTC TCATATAAGG 1920 

AGAATAATTA GATCTATTTG CTTTTAGTGT ATTTATT C T A TGAGCAGGGA AAGCCTTTAA 19 80 

ATCCTTAAGT GCTACTTAGA AAATAGCTTT AATTCTTAAC TGTTTATTAA GTCTGTAAGT 2 04 0 

TTAATAATGA TAAAGCTATA ATTGACAAAA TCCACATCTG TACTTCCAGT TTATTGACAG 2100 

CTCATTCAGC AGCCCCTAAA TTTCTTGGGA AGAGCAGGTG TTGGAGGCAG AGCAGTAAAA 2160 

GATTGAGATG ATCTCATCCT GTCTTAGAGC TTTGGCCATG GAATCAGAAT CACAGAATAT 22 2 0 

CCCAAGTTTG GAGGGATCTG TAAGGATCAT CGAGTCCAAT TGTGATGTTT AAAACATGTC 22 80 

ATTTAGCAAT GAGGTGTTGA GGAGAAGCAG TGAAGGCCAG CAGATGGATG TCTGTCAGGA 2 34 0 

TGGTCCCTCC TGGTCACTGC TAGTCCCTTC TTGTTTGAAA GGAAACACCC AAAATCTCCA 24 00 

CTGGTTAAAA CTTGTCACTA GAACCCATCT AGGAGAGTCC TGAGCTTCTG CTGATAAGCT 24 6 0 

GTAAAATCAA TTGTGATCAA ACATGATCAC AAGTGAGACA ATTCTAGGGA TGCCTGGAGG 2 520 

GAAATGACCC ACAGAGG CCA AAATACAGGT ATACAACTGG GGTTTTCTAC CTAAACTGAG 2 580 

GTGCTGAGAG TTTGAACAGG CACCCTACCC T AT AAC AC C C TGTTGCTCAC CATGGATGGT 2 64 0 

GTTGCAATCC TTTTGAATTA AGCATGTGGC TCCATGAGGC TGGCACCAGT AAGCCAG G AC 2 7 00 
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CTCCAAATGA C AG AG T AC AA CTGATGGAAT CACTGAGGTT TGAAGACACC TCTAAGACCA 276 0 

TTGAGCCCAA CCAGCTCATC CTTGAGCTCC TGTGGCTGCC CTCAGAGCTG CTACACCCTC 28 2 0 

ATCTCTGTTC ATTACCAGGT TGTGATTATT TGGGAGGAAG CTTGCCTCCT CCTTCCAGCC 28 8 0 

AGGAGAGCCC TCTCAGAGCA TGGAAGCAAT TAGTATTTTC AGTCAATCCA ATATATGCTG 2 940 

TCAGTCTGCA AATAGCCAAC TAAACAACAT GCCAGCGTGC TGCCATGCTG TCAGTCTGCA 3 0 00 

AATAGCCAAC TAAAC AACT A GCCAGCGTGC TGCCAGTCCC CTTCTACGGA CTGCTGGTCT 3 06 0 

CCCAGGGATA ACTTCAGGAA AGCTGTTTCA TTTGGGAAAG TTATTCCATG GCATCTGCTG 3120 

CAGGACATAC AGCTGAGAGG GAGAAGTCCT CCCAAGCACA GGAGAACATC TCCCATCCTA 318 0 

TGGAAGCACC GAATTGTGCA GGAGATAACC AACTGAAAAA CACAAACTTA CATCCTAACC 324 0 

CAGGGGATCA TCTCCAGTAG TCCAATTTTT GATAGACAAA TGTAAGTACA AATTTATGTC 33 00 

TGGTAAAAGC CAAGAAAATG GGTCAAGCAA AATTTATCCA AAGCACATTG TCTGAAGAAT 3 3 60 

GATGTGATAT ATTCAGCAAA ACCGATGTCA AGAAATTGAC AGAAGTTTAA AATAATAGCA 3420 

GATGACTTCA GAGATTTTCA GTGATTTCTG GAATATATTA TAAAAGCAAA AATATTTGCA 34 8 0 

CTGATCTGTG AT ATT TAAAG ATGTAACTGG GAAGAATCAC TGTTCAGATG TGTTGTTGTT 3 54 0 

ACCCCAGACA GAAGCAGGTA GTGAGTTTGT GCACATGTGT GGAGAGTGGA GACCCTGGCA 3 6 00 

AAAAATGGAG ATCTGGCAAA ATTCAAAGCT GGGTGAGCAG CCTGCTTACC CTGTGTGTTC 36 SO 

TAAAGTGGGG GCTGAAGGCA TCTCAAACTT ACTGCCTTCT GCAAAACGAG CATGTAACCC 372 0 

CATCCCGCAA CGTCAGGTGG CAGTATTAAA GCACTGAAGG CTTGAGTACA GTCTCTATTA 3 78 0 

GGCAACCTGG TTCACTTAAA AGTAGGTGGA AATCT AC CAC CACCAATGTA GGAGAGCACC 3840 

TTGTGTCTCT TCATCTGGGG AGTGGAGATA CAACTAACAA TCCTTCATCT AGGGAGGGAG 3 900 

ACTTATGTGG GGACCTGAAG CAATTTGAGA GTACAGCTGA GAACAAGAAA CCATACAAAA 3 96 0 

GGAAAATATG CATATTTTTT AGCCGTAGAA AATACTTGGT TGTGTATGCA TGTGTTATTA 4 0 20 

TGACTATATA GTGTTATTAC TATATCTTTA ATGATATAGT ACAGTTCTGT ATTTAATCTG 4 080 

TTGCCCCACC TGCAGCTGTT AATTGCTCAG AAAAT GAG C C TCTGTGGTGG CAAAATGTTG 414 0 

TCTTATTTAT CCGTGTTTTA ACACTGATAT ATATCTCTGG TTTGTTCTGA TACTACAGGA 4200 

AGAATGATTT TATTTCCAGA ATCTTACTGT TGCTCCAAGT TCTCCTTTTT TTTTAAAAAT 42 6 0 

GAAAAGTTTA GTTTGGGCTA TCCAGTAGCA GCTGTTGGAG CATTTGTGCT CCAGCAAGGA 4320 

GTTATGGTGT CTGGCTTTGT GTTTCTGTTC TAGGCTTGTT GGTAGAGAAT GGCATTGCCA 4 3 80 

GCTCTGCATT TTATAGCATA TTTCAAATAT TTATATTTAG CAGTTTGCCC CGTTTTCATT 4440 

CCTTGTTACA GCTCAAATAA AATGAGAGCT TTTACTTGTA ACCCTTTTTC TTCCATGAAG 4 500 

CTTTTATTGA CCCAGCAATC TGATTTCTGA TTATTTGCCT AATTAGTTGC CTTATTAAAG 4 56 0 

CTCACTCTTC TTTCTTCTGG AAAAAGTACC TTCTGGAATA ATGTCGGCCC TTAAGAAAAT 462 0 



^ISDOCID: <WO 9601468A1_I_> 



WO 98/01468 



POVUS97/11311 



- 58 - 

GATGAAAATT ACTGAAATTC TCAAGATTTT AACTATGAGA CCATTAGAGA GTTGGTATTT 45 8 0 

GAGTTACAAC TTTGATGTCT CAGATGTGAA TGTTTGGCGT CTCCATTCTT CTGCACCTTC 4 74 0 

AGTAGCAATA AAACATTAAT GTCCTGTAAA GGTT AATTC C TTTTCTTTGA GACCTTACCA 4 8 00 

CTGTCAAATA GGTTCTTCCA AG AC C AC ATT CCTCTGTGTC TCCTTGCCTG TCTGTAAGGT 486 0 

GATACAGTGA TAACGTGTCT GGGGAGAGTT TGAGTGCCAC AACTCTCCCA TAAAAAGTTT 4 92 0 

CTTATTTAGA AGAAAAAGGA AATAATATTA TAGGAGTGGA GTAAAGTTAA ACCAGGTGAG 4 98 0 

TTGTGCTAAA ATGGCATACT TGGGAAGTTG TCCAAGTCCA AATAAAG 502 7 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Ala Pro Cys Arg Ala Ala Cys Leu Leu Pro Leu Leu Val Ala Val 
1 5 io 15 

Ala Ser Ala Gly Leu Gly Gly Tyr Phe Gly Thr Lys Ser Arg Tyr Glu 
20 25 30 

Glu Val Asn Pro His Leu Ala Glu Asp Pro Leu Ser Leu Gly Pro His 
35 40 45 

i 

' 1: Ala Ala Ala Ala Arg Leu Pro Ala Ala Cys Ala Pro Leu Gin Leu Arq 
50 55 60 

Arg Val Val Arg His Gly Thr Arg Tyr Pro Thr Ala Gly Gin lie Arg 
65 70 75 00 

Arg Leu Ala Glu Leu His Gly Arg Leu Arg Arg Ala Ala Ala Pro Ser 
85 90 95 

Cys Pro Ala Ala Ala Ala Leu Ala Ala Trp Pro Met Trp Tyr Glu Glu 
100 105 no 

Ser Leu Asp Gly Arg Leu Ala Pro Arg Gly Arg Arg Asp Met Glu His 
115 120 125 

Leu Ala Arg Arg Leu Ala Ala Arg Phe Pro Ala Leu Phe Ala Ala Arg 
130 135 140 

Arg Arg Leu Ala Leu Ala Ser Ser Ser Lys His Arg Cys Leu Gin Ser 
145 150 155 160 

Gly Ala Ala Phe Arg Arg Gly Leu Gly Pro Ser Leu Ser Leu Gly Ala 
165 170 175 
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Asp Glu Thr Glu lie Glu Val Asn Asp Ala Leu Met Arg Phe Phe Asp 
180 185 190 

His Cys Asp Lys Phe Val Ala Phe Val Glu Asp Asn Asp Thr Ala Met 
195 200 205 

Tyr Gin Val Asn Ala Phe Lys Glu Gly Pro Glu Met Arg Lvs Val Leu 
210 215 220 

Glu Lys Val Ala Ser Ala Leu Cys Leu Pro Ala Ser Glu Leu Asn Ala 
225 230 235 240 

Asp Leu Val Gin Val Ala Phe Leu Thr Cys Ser Tyr Glu Leu Ala lie 
245 250 255 

Lys Asn Val Thr Ser Pro Trp Cys Ser Leu Phe Ser Glu Glu Asp Ala 
260 265 270 

Lys Val Leu Glu Tyr Leu Asn Asp Leu Lys Gin Tyr Trp Lys Arg Gly 
275 280 285 

Tyr Gly Tyr Asp lie Asn Ser Arg Ser Ser Cys lie Leu Phe Gin Asp 
290 295 300 

lie Phe Gin Gin Leu Asp Lys Ala Val Asp Glu Ser Arg Ser 
305 310 * 315 

(2) INFORMATION .FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 



GGCACGAAGG 


GAGGCGAGAG 


GATCCCGGAG 


CAGCTGGAGC 


AGGCGGCCGC 


GCCCGTCCTC 


60 


CTCTTCCTGC 


AGCTGCCGCC 


ATGGCGCCGT 


GCCGCGCTGC 


CTGTCGTCTG 


CCGCTTCTGG 


120 


TAGCGGTGGC 


GAGCGCCGGG 


CTGGGCGGCT 


ACTTCGGCAC 


CAAGTCCCGC 


TACGAGGAGG 


180 


TGAACCCGCA 


CCTGGCGGAG 


GACCCGCTGT 


CCCTCGGGCC 


GCACGCCGCC 


GCCGCCCGGC 


240 


TGCCCGCCGC 


CTGCGCCCCG 


CTGCAGCTCC 


GCCGCGTCGT 


CCGCCACGGC 


ACCCGCTACC 


300 


CCACGGCCGG 


GCAAATCCGC 


CGCCTGGCCG 


AGCTGCACGG 


CCGCCTCCGC 


CGCGCCGCCG 


360 


CCCCGTCCTG 


CCCCGCCGCC 


GCCGCGCTGG 


CCGCCTGGeC 


GATGTGGTAC 


GAGGAGAGCC 


420 


TCGACGGGCG 


GCTGGCGCCG 


CGGGGCCGCC 


GCGACATGGA 


ACACCTGGCG 


CGCCGCCTGG 


480 


CCGCCCGCTT 


CCCCGCGCTC 


TTCGCCGCCC 


GCCGCCGCCT 


GGCGCTGGCC 


AGCAGCTCCA 


540 


AGGACCGCTG 


CCTGCAGAGC 


GGCGCGGCCT 


TCCGGCGCGG 


CCTCGGGCCC 


TCCCTCAGCC 


600 


TCGGCGCCGA 


CGAGACGGAG 


ATCGAAGTGA 


ACGACGCGCT 


GATGAGGTTT 


TTTGATCACT 


660 
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GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT 720 

TCAAAGAGGG CCCGGAGATG AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC 780 

CGGCCAGCGA GCTGAACGCA GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT 840 

TGGCTATAAA AAATGTGACC TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 900 

TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC TATGACATCA 96 0 

ATAGTCGCTC CAGCTGCATT TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG 10 20 

ATGAGAGCAG AAGTTGACAG ATTGAAAATA GAGGTAGCCT TGCAATTTTG GATCAGAGGA 1080 

ATGATCTATC AAATTGTGAA GTCTTCCTCC TTGGAAGAAA AGCTTCAAAA GCTGCCCTGG 114 0 

CACTACCCTG GGATACAGCC TCCAGAGGTC CCTTCCCACC TCAAGCATTC TGTAACGCCA 12 00 

ATCACTTCTT ACAAAGAGGA CTGCGAAGAA GTTGTTCATC TAGATTTTTG CTCACTGAGG 12 6 0 

ATCTGAGTTA AATATCAACA GTGATAGAAC TGACTGTTAA GTCAGTTGAA GCAGAATTCT 13 2 0 

CAGTCAGTTG GCTTTTTTGT TGTGCTTCAG TGCTGGATGC AGAGATGCTG TGTGTTAAGC 13 80 

j| CCTCTTCATT TTGCTAT G AA CAGGCTAGAA CTTGTTGTAA GCTAGTTGTA AGCATGAAAC 1440 

CAACATAGCA CCGAGGACTA ATTGTGAAGG AAAGGTGGGC AGAAGGAAGT GGCTGTTGAT 15 00 

AGCAAACTCT CTGCAGCAAG CCTGGACATT GTGCTGCTAA ATCATTCTGG TTTTTGGAAA 15 60 

TCTAAGGGCT GTCAGAGCTG TTGATCCCTC TCATTTTGAG AGTGGTGGAG TCAAAGCTGT 16 20 

GGTT ATG C T A GATTGCCCTT T AAATAAAT C TCTACTGTAT CCTTTCTTCA GCATTCTGGG 16 8 0 

AAG CTAAATA AAAAATG CAT GAGGCCACAG GTCATTTACA TCCAACTGTG AAGAGATTGA 17 40 

CAAGCACACT GCTGTGATTG CTTCCATATA TGCTGTGTCT GCTTCTGCGA AGATAGAAAA 18 00 

TAIAAACAGA ATGAGGAGAC G AAG AG C AG A TTAAAAGTGA G CAG AC AAG C AGAGCAAAAC 18 60 

CCCTCTGCCC TTCTGAAGGA AAAAAAAATA ACTTCTTAAT GTAGCTTGTC TCATATAAGG 19 2 0 

AGAATAATTA GATCTATTTG CTTTTAGTGT ATTTATTCTA TGAGCAGGGA AAGCCTTTAA 19 80 

ATCCTTAAGT GCTACTTAGA AAATAGCTTT AATTCTTAAC TGTTTATTAA GTCTGTAAGT 2 04 0 

TTAATAATGA TAAAGCTATA ATTGACAAAA TCCACATCTG TACTTCCAGT TTATTGACAG 2100 

CTCATTCAGC AGCCCCTAAA TTTCTTGGGA AGAGCAGGTG TTGGAGGCAG AGCAGTAAAA 2160 

GATTGAGATG ATCTCATCCT GTCTTAGAGC TTTGGCCATG GAATCAGAAT CACAGAATAT 2220 

CCCAAGTTTG GAG 22 3 3 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



ATGGCGCCGT 


GCCGCGCTGC 


CTGTCGTCTG 


CCGCTTCTGG 


TAGCGGTGGC 


GAGCGCCGGG 


60 


CTGGGCGGCT 


ACTTCGGCAC 


CAAGTCCCGC 


TACGAGGAGG 


TGAACCCGCA 


CCTGGCGGAG 


120 


GACCCGCTGT 


CCCTCGGGCC 


GCACGCCGCC 


GCCGCCCGGC 


TGCCCGCCGC 


CTGCGCCCCG 


180 


CTGCAGCTCC 


GCCGCGTCGT 


CCGCCACGGC 


ACCCGCTACC 


CCACGGCCGG 


GCAAATCCGC 


240 


CGCCTGGCCG 


AGCTGCACGG 


CCGCCTCCGC 


CGCGCCGCCG 


CCCCGTCCTG 


CCCCGCCGCC 


300 


GCCGCGCTGG 


CCGCCTGGCC 


GATGTGGTAC 


G AGG AG AG C C 


TCGAGGGGCG 


GCTGGCGCCG 


360 


CGGGGCCGCC 


GCGACATGGA 


ACACCTGGCG 


CGCCGCCTGG 


CCGCCCGCTT 


CCCCGCGCTC 


420 


TTCGCCGCCC 


GCCGCCGCCT 


GGCGCTGGCC 


AGCAGCTCCA 


AGCACCGCTG 


C C TG C AG AG C 


460 


GGCGCGGCCT 


TCCGGCGCGG 


CCTCGGGCCC 


TCCCTCAGCC 


TCGGCGCCGA 


CGAGACGGAG 


540 


ATCGAAGTGA 


ACGACGCGCT 


GATGAGGTTT 


TTTGATCACT 


GCGACAAGTT 


CGTGGCCTTC 


600 


GTGGAGGACA 


ACGACACAGC 


CATGTACCAA 


GTGAACGCCT 


TCAAAGAGGG 


CCCGGAGATG 


660 


AGGAAGGTGT 


TGGAGAAGGT 


GGCGAGTGCC 


CTGTGTCTGC 


CGGCCAGCGA 


G CTG AACGC A 


720 


GATCTCGTTC 


AAGTGGCTTT 


CCTCACTTGC 


TCGTATGAGT 


TGGCTATAAA 


AAATGTGACC 


780 


TCCCCGTGGT 


GTTCGCTCTT 


CAGTGAAGAA 


GATGCTAAGG 


TACTGGAGTA 


CCTGAATGAC 


840 


CTGAAGCAAT 


ACTGGAAGAG 


AGGATATGGC 


TATGACATCA 


ATAGTCGCTC 


CAGCTGCATT 


9C0 


TTATTCCAGG 


ATATCTTCCA 


GCAGTTGGAC 


AAAGCAGTGG 


ATGAG AG C AG 


AAGT 


954 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:6: 

GGCACGAAGG GAGGCGAGAG GATCCCGGAG CAGCTGGAGC AGGCGGCCGC GCCCGTCCTC 60 

CTCTTCCTGC AGCTGCCGCC ATGGCGCCGT GCCGCGCTGC CTGTCGTCTG CCGCTTCTGG 12 0 

TAGCGGTGGC GAGCGCCGGG CTGGGCGGCT ACTTCGGCAC CAAGTCCCGC TACGAGGAGG 18 0 

TGAACCCGCA C CTGGCGG AG GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC 24 0 

TGCCCGCCGC CTGCGCCCCG CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC 3 00 

CCACGGCCGG GCAAATCCGC CGCCTGGCCG AGCTGCACGG CCGCCTCCGC CGCGCCGCCG 350 

CCCCGTCCTG CCCCGCCGCC GCCGCGCTGG CCGCCTGGCC GATGTGGTAC GAGGAGAGCC 420 

TCGACGGGCG GCTGGCGCCG CGGGGCCGCC GCGACATGGA ACAC CTGGCG CGCCGCCTGG 480 

CCGCCCGCTT CCCCGCGCTC TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA 54 0 

AGCACCGCTG CCTGCAGAGC GGCGCGGCCT TCCGGCGCGG CCTCGGGCCC TCCCTCAGCC 6 00 

TCGGCGCCGA CGAGACGGAG ATCGAAGTGA ACGACG CGCT GATGAGGTTT TTTGATCACT 660 

GCGACAAGTT CGTGGCCTTC GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT 72 0 

TCAAAGAGGG CCCGGAGATG AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC 780 

CGGCCAGCGA GCTGAACGCA GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT 640 

TGGCTATAAA AAATGTG AC C TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG 9 00 

TACTGGAGTA CCTGAATGAC CTGAAGCAAT ACTGGAAGAG AGGATATGGC TATGACATCA 96 0 

ATAGTCGCTC C AG CTG C ATT TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG 1020 

ATGAGAGCAG AAGTTCAAAA CCCATTTCTT CACCTTTGAT TGTACAAGTT GGACATGCAG 1080 

AAACACTTCA GCCACTTCTT GCTCTTATGG GCTACTTCAA AGATGCTGAG CCTCTCCAGG 114 0 

CCAACAATTA CATCCGCCAG GCGCATCGGA AGTTCCGCAG CGGCCGGATA GTGCCTTATG 12 0 0 

CAGCCAACCT GGTGTTTGTG CTGTACCACT GTGAGCAGAA GACCTCTAAG GAGGAGTACC 126 0 

AAGTGCAGAT GTTGCTGAAT GAAAAGCCAA TGCTCTTTCA TCACTCGAAT GAAACCATCT 132 0 

CCACGTATGC AGACCTCAAG AGCTATTACA AGGACATCCT TCAAAACTGT CACTTCGAAG 138 0 

AAGTGTGTGA ATTGCCCAAA GTCAATGGTA CCGTTGCTGA CGAACTTTGA GGGAATGAAA 144 0 

TGGAGTGGCC GATTTGGAAA CCGATCTCAG TTTTCTTCAA CAGATGTTGT GAACGAGCAC 15 00 

TTTGGATGCA ATGCTGCTGC TGTGCCGACT CTCTAAGCTC GCAGATTTGA CGGCCGTTAT 1560 

TTACCTGGGT TGTCTCTGTC AGCTCAA 15 8 7 
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(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Met Ala Pro Cys Arg Ala Ala Cys Leu Leu Pro Leu Leu Val Ala Val 

1 5 10 15 

Ala Ser Ala Gly Leu Gly Gly "Tyr Phe Gly Thr Lys Ser Arg Tyr Glu 
20 25 30 

Glu Val Asn Pro His Leu Ala Glu Asp Pro Leu Ser Leu Gly Pro His 
35 40 45 

Ala Ala Ala Ala Arg Leu Pro Ala Ala Cys Ala Pro Leu Gin Leu Arg 
SO 55 60 

Arg Val Val Arg His Gly Thr Arg Tyr Pro Thr Ala Gly Gin He Arg 
65 70 75 80 

Arg Leu Ala Giu Leu His Gly Arg Leu Arg Arg Ala Ala Ala Pro Ser 
85 90 95 



Cys Pro Ala Ala Ala Ala Leu Ala Ala Trp Pro Met Trp Tyr Glu Glu 
100 105 110 

Ser Leu Asp Gly Arg Leu Ala Pro Arg Gly Arg Arg Asp Met Glu His 
115 120 125 



Leu Ala Arg Arg Leu Ala Ala Arg Phe Pro Ala Leu Phe Ala Ala Arg 
130 135 140 

Arg Arg Leu Ala Leu Ala Ser Ser Ser Lys His Arg Cys Leu Gin Ser 
145 150 155 160 

Gly Ala Ala Phe Arg Arg Gly Leu Gly Pro Ser Leu Ser Leu Gly Ala 
165 170 175 



Asp Glu Thr Giu lie Glu Val Asn Asp Ala Leu Met Arg Phe Phe Asp 
180 185 190 

His Cys Asp Lys Phe Val Ala Phe Val Glu Asp Asn Asp Thr Ala Met 
195 200 205 

Tyr Gin Val Asn Ala Phe Lys Glu Gly Pro Glu Met Arg Lys Val Leu 
210 215 220 

Glu Lys Val Ala Ser Ala Leu Cys Leu Pro Ala Ser Glu Leu Asn Ala 
225 ' 230 235 240 

Asp Leu Val Gin Val Ala Phe Leu Thr Cys Ser Tyr Glu Leu Ala He 
245 250 255 
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Lys Asn Val Thr Ser Pro Trp Cys Ser Leu Phe Ser Glu Glu Asp Ala 
260 255 270 

Lys Val Leu Glu Tyr Leu Asn Asp Leu Lys Gin Tyr Trp Lys Arg Gly 
275 280 285 

Tyr Gly Tyr Asp lie Asn Ser Arg Ser Ser Cys lie Leu Phe Gin Asx> 
290 295 300 

He Phe Gin Gin Leu Asp Lys Ala Val Asp Glu Ser Arg Ser Ser Lys 
305 310 315 320 

Pro He Ser Ser Pro Leu He Val Gin Val Gly His Ala Glu Thr Leu 
325 330 335 

Gin Pro Leu Leu Ala Leu Met Gly Tyr Phe Lys Asp Ala Glu Pro Leu 
340 345 350 

Gin Ala Asn Asn Tyr He Arg Gin Ala His Arg Lys Phe Arg Ser Gly 
355 3S0 365 

Arg He Val Pro Tyr Ala Ala Asn Leu Val Phe Val Leu Tyr His Cys 
370 375 380 

Glu Gin Lys Thr Ser Lys Glu Glu Tyr Gin Val Gin Met Leu Leu Asn 
385 390 395 400 

Glu Lys Pro Met Leu Phe His His Ser Asn Glu Thr He Ser Thr Tyr 
405 410 415 

Ala Asp Leu Lys Ser Tyr Tyr Lys Asp Xle Leu Gin Asn Cys His Phe 
420 425 430 

Glu Glu Val Cys Glu Leu Pro Lys Val Asn Gly Thr Val Ala Asp Glu 
435 440 445 

Leu 

i 

'I. 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGGCGCCGT GCCGCGCTGC CTGTCGTCTG CCGCTTCTGG TAGCGGTGGC GAGCGCCGGG 6 0 

CTGGGCGGCT ACTTCGGCAC CAAGTCCCGC TACGAGGAGG TGAACCCGCA CCTGGCGGAG 12 0 

GACCCGCTGT CCCTCGGGCC GCACGCCGCC GCCGCCCGGC TGCCCGCCGC CTGCGCCCCG 18 0 

CTGCAGCTCC GCCGCGTCGT CCGCCACGGC ACCCGCTACC CCACGGCCGG GCAAATCCGC 240 

CGCCTGGCCG AGCTGCACGG CCGCCTCCGC CGCGCCGCCG CCCCGTCCTG CCCCGCCGCC 3 00 
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GCCGCGCTGG CCGCCTGGCC GATGTGGTAC GAGGAGAGCC TCGACGGGCG GCTGGCGCCG 360 

CGGGGCCGCC GCGACATGGA ACACCTGGCG CGCCGCCTGG CCGCCCGCTT CCCCGCGCTC 420 

TTCGCCGCCC GCCGCCGCCT GGCGCTGGCC AGCAGCTCCA AGCACCGCTG CCTG CAGAGC 480 

GGCGCGGCCT TCCGGCGCGG CCTGGGGCCC TCCCTCAGCC TCGGCGCCGA CGAGACGGAG 540 

ATCGAAGTGA ACGACGCGCT GATGAGGTTT TTTGATCACT GCGACAAGTT CGTGGCCTTC 6 00 

GTGGAGGACA ACGACACAGC CATGTACCAA GTGAACGCCT TCAAAGAGGG CCCGGAGATG 660 
AGGAAGGTGT TGGAGAAGGT GGCGAGTGCC CTGTGTCTGC CGGCCAGCGA GCTGAACGCA ' 720 

GATCTCGTTC AAGTGGCTTT CCTCACTTGC TCGTATGAGT TGGCTATAAA AAATGTGACC 7 BO 

TCCCCGTGGT GTTCGCTCTT CAGTGAAGAA GATGCTAAGG TACTGGAGTA CCTGAATGAC 840 

CTGAAGCAAT ACTGGAAGAG AGGATATGGC TATGACATCA ATAGTCGCTC CAGCTGCATT 900 

TTATTCCAGG ATATCTTCCA GCAGTTGGAC AAAGCAGTGG ATGAGAGCAG AAGTTCAAAA 960 

CCCATTTCTT CACCTTTGAT TGTACAAGTT GGACATGCAG AAACACTTCA GCCACTTCTT 102 0 

GCTCTTATGG GCTACTTCAA AGATGCTGAG CCTCTCCAGG CCAACAATTA CATCCGCCAG 1080 

GCGCATCGGA AGTTCCGCAG CGGCCGGATA GTGCCTTATG CAGCCAACCT GGTGTTTGTG 114 0 

CTGTACCACT GTGAGCAGAA GACCTCTAAG GAGGAGTACC AAGTGCAGAT GTTGCTGAAT 1200 

GAAAAGCCAA TGCTCTTTCA TCACTCGAAT GAAACCATCT CCACGTATGC AGACCTCAAG 125 0 

AG CTATTAC A AGGACATCCT TCAAAACTGT CACTTCGAAG AAGTGTGTGA ATTGCCCAAA 13 20 

GTCAATGGTA CCGTTGCTGA CGAACTT 134 7 
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WHAT IS CLAIMED; 

1. An isolated protein or polypeptide selectively 
expressed in chondrocytes in lower proliferative or upper 
hypertrophic zones of long hone and embryonic vertebrae growth 
plates . 

2 . An isolated protein or polypeptide according to 
claim 1, wherein said protein or polypeptide is substantially 
undetectable in articular cartilage or brain tissue. 

3 . An isolated protein or polypeptide according to 
claim 1, wherein said protein or polypeptide has a molecular 
weight of from about 34 to about 40 kDa . 

4. An isolated protein or polypeptide according to 
claim 3, wherein the protein or polypeptide comprises an amino 
acid sequence corresponding to SEQ. ID. No. 3. 

5 . An isolated protein or polypeptide according to 
claim 1, wherein said protein has a molecular weight of from 
about 4 7 to about 53 kDa. 

6. An isolated protein or polypeptide according to 
claim 5, wherein the protein or polypeptide comprises an amino 
acid sequence corresponding to SEQ. ID. No. 7. 

7. An isolated protein or polypeptide according to 
claim 1, wherein the protein or polypeptide is purified. 

8. An isolated protein or polypeptide according to 
claim 1, wherein the protein or polypeptide is recombinant. 

9. An ' isolated DNA molecule encoding a protein or 
polypeptide according to claim 1: 
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10. An isolated DNA molecule according to claim 9, 
wherein the protein or polypeptide is substantially undetectable 
in articular cartilage or brain tissue. 

11. An isolated DNA molecule according to claim 9, 
wherein the protein or polypeptide has a molecular weight of from 
about 3 4 to about 4 0 kDa . 

12. An isolated DNA molecule according to claim 11, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ . ID. No. 3. 

13. An isolated DNA molecule according to claim 12, 
wherein said DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 2. 

14. An isolated DNA molecule according to claim 12, 
wherein said DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 4. 

15. An isolated DNA molecule according to claim 12, 
wherein said DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 5. 

16. An isolated DNA molecule according to claim 9, 
wherein the protein or polypeptide has a molecular weight of from 
about 4 7 to about 53 kDa. 

17. An isolated DNA molecule according to claim 16, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ. ID. No. 7. 

18. An isolated DNA molecule according to claim 17, 
wherein said DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 6. 
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is. An isolated DNA molecule according to claim 17, 
wherein said DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 8. 

20. An expression system comprising a DNA molecule 
according to claim 9. 

21. An expression system according to claim 20, 
wherein the protean or polypeptide has a molecular weight of from 
about 3 4 to about 4 0 kDa . 

22. An expression system according to claim 21, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ. ID. No. 3. 

23. An expression system according to claim 20, 
wherein the protein or polypeptide has a molecular weighc of from 
about 47 to about 53 kDa. 

24. An expression system according to claim 23, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ . ID. No. 7. 

25. A host cell transformed with a heterologous DNA 
molecule according to claim 9. 

26. A host cell according to claim 25, wherein the DNA 
molecule is inserted into a heterologous expression system. 

27. A host cell according to claim 25, wherein the 
protein or polypeptide has a molecular weight of from about 34 to 
about 4 0 kDa . 
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28. A host cell according to claim 27, wherein the 
protein or polypeptide comprises an amino acid sequence 
corresponding to SEQ. ID. No. 3. 

29. A host cell according to claim 25, wherein the 
protein or polypeptide has a molecular weight of from about 47 to 
about 53 kDa. 

30. A host cell according to claim 29, wherein the 
protein or polypeptide comprises an amino acid sequence 
corresponding to SEQ. ID. No. 7. 

31. An isolated antibody, binding portion thereof, or 
probe against a protein or polypeptide according to claim I . 

32. An isolated antibody, binding portion thereof, or 
probe according to claim 31, wherein the protein or polypeptide 
is substantially undetectable in articular cartilage or brain 
tissue. 

33. An isolated antibody, binding portion thereof, or 
probe according to claim 31, wherein the protein or polypeptide 
has a molecular weight of from about 3 4 to about 4 0 kDa. 

34. An isolated antibody, binding portion thereof, or 
probe according to claim 31, wherein the protein or polypeptide" 
has a molecular weight of from about 47 to about 53 kDa. 

35. An isolated antibody, binding portion thereof, or 
probe according to claim 31, wherein the antibody is polyclonal 
or monoclonal . 

36. A method for identifying the occurrence of 
proliferation or hypertrophy of chondrocytes in a tissue sample 
comprising : 
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providing an isolated antibody, binding portion 
thereof, or probe according to claim 31; 

contacting the sample with the isolated antibody, 
binding portion thereof, or probe; and 

detecting any reaction which indicates that an 
isolated protein or polypeptide selectively expressed in 
chondrocytes in lower proliferative or upper hypertrophic zones 
of long bone and embryonic vertebrae growth plates is present in 
the sample using an assay system. 

37. A method according to claim 36, wherein the assay 
system is selected from the group consisting of an enzyme- linked 
immunosorbent assay, a radioimmunoassay, a gel diffusion 
precipitation reaction assay, an immunodiffusion assay, an 
agglutination assay, a fluorescent immunoassay, a protein A 
immunoassay, and an Immunoelectrophoresis assay. 

38. A method for identifying the occurrence of 
proliferation or hypertrophy of chondrocytes in a tissue sample 
comprising : 

providing a nucleotide, sequence of the DNA 
molecule according to claim 9 as a probe in a nucleic acid 
hybridization assay; 

contacting the sample with the probe; and 
detecting any reaction which indicates that an 
isolated protein or polypeptide selectively expressed in 
chondrocytes in lower proliferative or upper hypertrophic zones 
of long bone and embryonic vertebrae growth plates is present in 
the sample using an assay system. 

39. A method according to claim 38, wherein the assay 
system is selected from the group consisting of a Southern Blot, 
a Northern Blot, an RNAase protection assay, and Colony blot. 
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40. A method for identifying the occurrence of 
proliferation or hypertrophy of chondrocytes in a tissue sample 
comprising : 

providing a nucleotide sequence of the DNA 
molecule according to claim 9 as a probe in a gene amplification 
detection procedure; 

contacting the sample with the probe; and 
detecting any reaction which indicates that an 
isolated protein or polypeptide selectively expressed in 
chondrocytes in lower proliferative or upper hypertrophic zones 
of long bone and embryonic vertebrae growth plates is present in 
the sample using an assay system. 

41. A method for preventing chondrocytes from 
transitioning from proliferation to hypertrophy | comprising : 

reducing expression of a protein or polypeptide 
according to claim 1 in the chondrocytes. 

42. A method for inducing chondrocytes to transition 
from proliferation to hypertrophy comprising: 

increasing expression of a protein or polypeptide 
i. t according to claim 1 in the chondrocytes. 

43. A method for inhibiting arthritic progression of 
articular chondrocytes in a patient comprising: 

administering an effective amount of an antibody,^ 
binding portion thereof, or probe according to claim 31 to the 
patient. 

44. A method for treating non-union bone defects in a 
patient comprising: 

' administering an effective amount of a protein or 
polypeptide according to claim 1 to the patient . 
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45. A method for treating non-union bone defects in a 
patient comprising: 

administering an effective amount of a DNA 
molecule according to claim 9 . 
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